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J? CROSS-REFERENCE TO RELATED APPLICATIONS 



%\ USEFUL IN TREATMENT AND DETECTION OF CANCER 



NUCLEIC ACID AND CORRESPONDING PROTEIN ENTITLED 121P1F1 



This application is a continuation-in-part of United States Serial No. 09/779,250 filed March 5, 2001. The 



5 content of that application is hereby incorporated by reference herein in its entirety. 
FIELD OF THE INVENTION 

The invention described herein relates to a gene and its encoded protein, termed 121P1F1, expressed in 
certain cancers, and to diagnostic and therapeutic methods and compositions useful in the management of cancers 
that express 121P1F1. 



BACKGROUND OF THE INVENTION 

Cancer is the second leading cause of human death next to coronary disease. Worldwide, millions of 
people die from cancer every year. In the United States alone, as reported by the American Cancer Society, cancer 
causes the death of well over a half-million people annually, with over 1.2 million new cases diagnosed per year. 
15 While deaths from heart disease have been declining significantly, those resulting from cancer generally are on the 
rise. In the early part of the next century, cancer is predicted to become the leading cause of death. 

Worldwide, several cancers stand out as the leading killers. In particular, carcinomas of the lung, prostate, 
breast, colon, pancreas, and ovary represent the primary causes of cancer death. These and virtually all other 
carcinomas share a common lethal feature. With very few exceptions, metastatic disease from a carcinoma is fatal. 
20 Moreover, even for those cancer patients who initially survive their primary cancers, common experience has 
shown that their lives are dramatically altered. Many cancer patients experience strong anxieties driven by the 
awareness of the potential for recurrence or treatment failure. Many cancer patients experience physical 
debilitations following treatment. Furthermore, many cancer patients experience a recurrence. 



25 Europe, it is by far the most common cancer in males and is the second leading cause of cancer death in men. In the 
United States alone, well over 30,000 men die annually of this disease - second only to lung cancer. Despite the 
magnitude of these figures, there is still no effective treatment for metastatic prostate cancer. Surgical 
prostatectomy, radiation therapy, hormone ablation therapy, surgical castration and chemotherapy continue to be the 
main treatment modalities. Unfortunately, these treatments are ineffective for many and are often associated with 

30 undesirable consequences. 

On the diagnostic front, the lack of a prostate tumor marker that can accurately detect early-stage, localized 
tumors remains a significant limitation in the diagnosis and management of this disease. Although the serum 
prostate specific antigen (PSA) assay has been a very useful tool, however its specificity and general utility is 
widely regarded as lacking in several important respects. 

35 Progress in identifying additional specific markers for prostate cancer has been improved by the generation 

of prostate cancer xenografts that can recapitulate different stages of the disease in mice. The LAPC (Los Angeles 
Prostate Cancer) xenografts are prostate cancer xenografts that have survived passage in severe combined immune 
deficient (SCID) mice and have exhibited the capacity to mimic the transition from androgen dependence to 
androgen independence (Klein et ai. 3 1997, Nat. Med. 3:402). More recently identified prostate cancer markers. 

40 include PCTA-1 (Su et ai, 1996, Proc. Natl. Acad. Sci. USA 93: 7252), prostate-specific membrane (PSM) antigen 
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(Pinto et aL, Clin Cancer Res 1996 Sep 2 (9): 1445-51), STEAP (Hubert, et aL y Proc Natl Acad Sci USA. 1999 
Dec 7; 96(25): 14523-8) and prostate stem cell antigen (PSCA) (Reiter et al., 1998, Proc. Natl. Acad. Sci. USA 95: 
1735). 

While previously identified markers such as PSA, PSM, PCTA and PSCA have facilitated efforts to 
5 diagnose and treat prostate cancer, there is need for the identification of additional markers and therapeutic targets 
for prostate and related cancers in order to further improve diagnosis and therapy. 

Renal cell carcinoma (RCC) accounts for approximately 3 percent of adult malignancies. Once adenomas 
reach a diameter of 2 to 3 cm, malignant potential exists. In the adult, the two principal malignant renal tumors are 
renal cell adenocarcinoma and transitional cell carcinoma of the renal pelvis or ureter. The incidence of renal cell 
10 adenocarcinoma is estimated at more than 29,000 cases in the United States, and more than 1 1,600 patients died of 
this disease in 1998. Transitional cell carcinoma is less frequent, with an incidence of approximately 500 cases per 
year in the United States. 

Surgery has been the primary therapy for renal cell adenocarcinoma for many decades. Until recently, 
metastatic disease has been refractory to any systemic therapy. With recent developments in systemic therapies, 
15 particularly immunotherapies, metastatic renal cell carcinoma may be approached aggressively in appropriate 

patients with a possibility of durable responses. Nevertheless, there is a remaining need for effective therapies for 
these patients. 

Of all new cases of cancer in the United States, bladder cancer represents approximately 5 percent in men 
(fifth most common neoplasm) and 3 percent in women (eighth most common neoplasm). The incidence is 

20 increasing slowly, concurrent with an increasing older population. In 1998, there was an estimated 54,500 cases, 
including 39,500 in men and 15,000 in women. The age-adjusted incidence in the United States is 32 per 100,000 
for men and 8 per 100,000 in women. The historic male/female ratio of 3: 1 may be decreasing related to smoking 
patterns in women. There were an estimated 1 1,000 deaths from bladder cancer in 1998 (7,800 in men and 3,900 in 
women). Bladder cancer incidence and mortality strongly increase with age and will be an increasing problem as 

25 the population becomes more elderly. 

Most bladder cancers recur in the bladder. Bladder cancer is managed with a combination of transurethral 
resection of the bladder (TUR) and intravesical chemotherapy or immunotherapy. The multifocal and recurrent 
nature of bladder cancer points out the limitations of TUR. Most muscle-invasive cancers are not cured by TUR 
alone. Radical cystectomy and urinary diversion is the most effective means to eliminate the cancer but carry an 

30 undeniable impact on urinary and sexual function. There continues to be a significant need for treatment modalities 
that are beneficial for bladder cancer patients. 

An estimated 130,200 cases of colorectal cancer occurred in 2000 in the United States, including 93,800 
cases of colon cancer and 36,400 of rectal cancer. Colorectal cancers are the third most common cancers in men 
and women. Incidence rates declined significantly during 1992-1996 (-2.1% per year). Research suggests that 

35 these declines have been due to increased screening and polyp removal, preventing progression of polyps to 

invasive cancers. There were an estimated 56,300 deaths (47,700 from colon cancer, 8,600 from rectal cancer) in 
2000, accounting for about 1 1% of all U.S. cancer deaths. 

At present, surgery is the most common form of therapy for colorectal cancer, and for cancers that have not 
spread, it is frequently curative. Chemotherapy, or chemotherapy plus radiation, is given before or after surgery to 

40 most patients whose cancer has deeply perforated the bowel wall or has spread to the lymph nodes. A permanent 
colostomy (creation of an abdominal opening for elimination of body wastes) is occasionally needed for colon 
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cancer and is infrequently required for rectal cancer. There continues to be a need for effective diagnostic and 
treatment modalities for colorectal cancer. 

There were an estimated 164,100 new cases of lung and bronchial cancer in 2000, accounting for 14% of 
all U.S. cancer diagnoses. The incidence rate of lung and bronchial cancer is declining significantly in men, from a 
5 high of 86.5 per 100,000 in 1984 to 70.0 in 1996. In the 1990s, the rate of increase among women began to slow. 
In 1996, the incidence rate in women was 42.3 per 100,000. 

Lung and bronchial cancer caused an estimated 156,900 deaths in 2000, accounting for 28% of all cancer 
deaths. During 1992-1996, mortality from lung cancer declined significantly among men (-1.7% per year) while 
rates for women were still significantly increasing (0.9% per year). Since 1987, more women have died each year 

1 0 of lung cancer than breast cancer, which, for over 40 years, was the major cause of cancer death in women. 

Decreasing lung cancer incidence and mortality rates most likely resulted from decreased smoking rates oyer the 
previous 30 years; however, decreasing smoking patterns among women lag behind those of men. Of concern, 
although the declines in adult tobacco use have slowed, tobacco use in youth is increasing again. 

Treatment options for lung and bronchial cancer are determined by the type and stage of the cancer and 

15 include surgery, radiation therapy, and chemotherapy. For many localized cancers, surgery is usually the treatment 
of choice. Because the disease has usually spread by the time it is discovered, radiation therapy and chemotherapy 
are often needed in combination with surgery. Chemotherapy alone or combined with radiation is the treatment of 
choice for small cell lung cancer; on this regimen, a large percentage of patients experience remission, which in 
some cases is long lasting. There is however, an ongoing need for effective treatment and diagnostic approaches for 

20 lung and bronchial cancers. 

An estimated 182,800 new invasive cases of breast cancer were expected to occur among women in the 
United States during 2000. Additionally, about 1,400 new cases of breast cancer were expected to be diagnosed in 
men in 2000. After increasing about 4% per year in the 1980s, breast cancer incidence rates in women have leveled 
off in the 1990s to about 1 10.6 cases per 100,000. 

25 In the U.S. alone, there were an estimated 41,200 deaths (40,800 women, 400 men) in 2000 due to breast 

cancer. Breast cancer ranks second among cancer deaths in women. According to the most recent data, mortality 
rates declined significantly during 1992-1996 with the largest decreases in younger women, both white and black. 
These decreases were probably the result of earlier detection and improved treatment. 

Taking into account the medical circumstances and the patient's preferences, treatment of breast cancer 

30 may involve lumpectomy (local removal of the tumor) and removal of the lymph nodes under the arm; mastectomy 
(surgical removal of the breast) and removal of the lymph nodes under the arm; radiation therapy; chemotherapy; or 
hormone therapy. Often, two or more methods are used in combination. Numerous studies have shown that, for 
early stage disease, long-term survival rates after lumpectomy plus radiotherapy are similar to survival rates after 
modified radical mastectomy. Significant advances in reconstruction techniques provide several options for breast 

35 reconstruction after mastectomy. Recently, such reconstruction has been done at the same time as the mastectomy. 

Local excision of ductal carcinoma in situ (DCIS) with adequate amounts of surrounding normal breast 
tissue may prevent the local recurrence of the DCIS. Radiation to the breast and/or tamoxifen may reduce the 
chance of DCIS occurring in the remaining breast tissue. This is important because DCIS, if left untreated, may 
develop into invasive breast cancer. Nevertheless, there are serious side effects or sequelae to these treatments. 

40 There is, therefore, a need for efficacious breast cancer treatments. 

There were an estimated 23,100 new cases of ovarian cancer in the United States in 2000. It accounts for 
4% of all cancers among women and ranks second among gynecologic cancers. During 1992-1996, ovarian cancer 
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incidence rates were significantly declining. Consequent to ovarian cancer, there were an estimated 14,000 deaths 
in 2000. Ovarian cancer causes more deaths than any other cancer of the female reproductive system. 

Surgery, radiation therapy, and chemotherapy are treatment options for ovarian cancer. Surgery usually 
includes the removal of one or both ovaries, the fallopian tubes (salpingo-oophorectomy), and the uterus 
5 (hysterectomy). In some very early tumors, only the involved ovary will be removed, especially in young women 
who wish to have children. In advanced disease, an attempt is made to remove all intra-abdominal disease to 
enhance the effect of chemotherapy. There continues to be an important need for effective treatment options for 
ovarian cancer. 

There were an estimated 28,300 new cases of pancreatic cancer in the United States in 2000. Over the past 
10 20 years, rates of pancreatic cancer have declined in men. Rates among women have remained approximately 
constant but may be beginning to decline. Pancreatic cancer caused an estimated 28,200 deaths in 2000 in the 
United States. Over the past 20 years, there has been a slight but significant decrease in mortality rates among men 
(about -0.9% per year) while rates have increased slightly among women. 

Surgery, radiation therapy, and chemotherapy are treatment options for pancreatic cancer. These treatment 
1 5 options can extend survival and/or relieve symptoms in many patients but are not likely to produce a cure for most. 
There is a significant need for additional therapeutic and diagnostic options for pancreatic cancer. 



SUMMARY OF THE INVENTION 

The present invention relates to a gene, designated 121P1F1, that has now been found to be over-expressed 

20 in the cancer(s) listed in Table I. Northern blot expression analysis of 121 PI Fl gene expression in normal tissues 
shows a restricted expression pattern in adult tissues. The nucleotide (Figure 2) and amino acid (Figure 2, and 
Figure 3) sequences of 121 PI Fl are provided. The tissue-related profile of 121 PI Fl in normal adult tissues, 
combined with the over-expression observed in the tumors listed in Table I, shows that 12 1 PI Fl is aberrantly over- 
expressed in at least some cancers, and thus serves as a useful diagnostic, prophylactic, prognostic, and/or 

25 therapeutic target for cancers of the tissue(s) such as those listed in Table I. 

The invention provides polynucleotides corresponding or complementary to all or part of the 121P1F1 
genes, mRNAs, and/or coding sequences, preferably in isolated form, including polynucleotides encoding 121P1F1- 
related proteins and fragments of 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24, 25, or 
more than 25 contiguous amino acids; at least 30, 35, 40, 45, 50, 55, 60, 65, 70, 80, 85, 90, 95, 100 or more than 

30 100 contiguous amino acids of a 12 1P1F1 -related protein, as well as the peptides/proteins themselves; DNA, RNA, 
DNA/RNA hybrids, and related molecules, polynucleotides or oligonucleotides complementary or having at least a 
90% homology to the 121 PI Fl genes or mRNA sequences or parts thereof, and polynucleotides or oligonucleotides 
that hybridize to the 1 2 1 P 1 F 1 genes, mRNAs, or to 1 2 1 P 1 F 1 -encoding polynucleotides. Also provided are means 
for isolating cDN As and the genes encoding 1 2 1 P 1 F 1 . Recombinant DNA molecules containing 1 2 1 P 1 F 1 

35 polynucleotides, cells transformed or transduced with such molecules, and host- vector systems for the expression of 
12 1 PI Fl gene products are also provided. The invention further provides antibodies that bind to 121P1F1 proteins 
and polypeptide fragments thereof, including polyclonal and monoclonal antibodies, murine and other mammalian 
antibodies, chimeric antibodies, humanized and fully human antibodies, and antibodies labeled with a detectable 
marker or therapeutic agent. In certain embodiments there is a proviso that the entire nucleic acid sequence of 

40 Figure 2 is not encoded and/or the entire amino acid sequence of Figure 2 is not prepared. In certain embodiments, 
the entire nucleic acid sequence of Figure 2 is encoded and/or the entire amino acid sequence of Figure 2 is 
prepared, either of which are in respective human unit dose forms. 



511582003440 



The invention further provides methods for detecting the presence and status of 121 PI Fl polynucleotides 
and proteins in various biological samples, as well as methods for identifying cells that express 121P1F1. A typical 
embodiment of this invention provides methods for monitoring 121P1F1 gene products in a tissue or hematology 
sample having or suspected of having some form of growth dysregulation such as cancer. 
5 The invention further provides various immunogenic or therapeutic compositions and strategies for 

treating cancers that express 121P1F1 such as cancers of tissues listed in Table I, including therapies aimed at 
inhibiting the transcription, translation, processing or function of 121 PI Fl as well as cancer vaccines. In one 
aspect, the invention provides compositions, and methods comprising them, for treating a cancer that expresses 
121P1F1 in a human subject wherein the composition comprises a carrier suitable for human use and a human unit 

10 dose of one or more than one agent that inhibits the production or function of 121 PI Fl. Preferably, the carrier is a 
uniquely human carrier. In another aspect of the invention, the agent is a moiety that is immunoreactive with 
121P1F1 protein. Non-limiting examples of such moieties include, but are not limited to, antibodies (such as single 
chain, monoclonal, polyclonal, humanized, chimeric, or human antibodies), functional equivalents thereof (whether 
naturally occurring or synthetic), and combinations thereof. The antibodies can be conjugated to a diagnostic or 

15 therapeutic moiety. In another aspect, the agent is a small molecule as defined herein. 

In another aspect, the agent comprises one or more than one peptide which comprises a cytotoxic T 
lymphocyte (CTL) epitope that binds an HLA class I molecule in a human to elicit a CTL response to 121P1F1 
and/or one or more than one peptide which comprises a helper T lymphocyte (HTL) epitope which binds an HLA 
class II molecule in a human to elicit an HTL response. The peptides of the invention may be on the same or on one 

20 or more separate polypeptide molecules. In a further aspect of the invention, the agent comprises one or more than 
one nucleic acid molecule that expresses one or more than one of the CTL or HTL response stimulating peptides as 
described above. In yet another aspect of the invention, the one or more than one nucleic acid molecule may 
express a moiety that is immunologically reactive with 121P1F1 as described above. The one or more than one 
nucleic acid molecule may also be, or encodes, a molecule that inhibits production of 121 PI Fl. Non-limiting 

25 examples of such molecules include, but are not limited to, those complementary to a nucleotide sequence essential 
for production of 121 PI Fl (e.g. antisense sequences or molecules that form a triple helix with a nucleotide double 
helix essential for 121 PI Fl production) or a ribozy me effective to lyse 121P1F1 mRNA. 

BRIEF DESCRIPTION OF THE FIGURES 

30 

Figure 1. The 121P1F1 SSH sequence of 254 nucleotides. 

Figure 2. The cDNA (SEQ ID. NO. : ) and amino acid sequence (SEQ ID. NO. : ) of 121 PI Fl is 
shown in Figure 2A. The start methionine is underlined. The open reading frame extends from nucleic acid 82-699 
including the stop codon. The nucleic acid and amino acid sequence of 121 PI Fl variant 1 A (SEQ ID. NO. : — ) 

35 is shown in Figure 2B, the codon for the start methionine is underlined. The open reading frame for variant 1 A 
extends from nucleic acid 82 to 462 including the stop codon. The nucleic acid and amino acid sequence of 
121P1F1 variant IB (SEQ I D. NO. : ) is shown in Figure 2C, the codon for the start methionine is underlined. 
The open reading frame for variant IB extends from nucleic acid 501-860 including the stop codon. The nucleic 
acid and amino acid sequence of 121 PI Fl variant 2 (SEQ ID. NO. : ) is shown in Figure 2D, the codon for the 

40 start methionine is underlined. The open reading frame for variant 2 extends from nucleic acid 82-450 including the 
stop codon. The nucleic acid and amino acid sequence of 121 PI Fl variant 3 (SEQ I D. NO. : is shown in 
Figure 2E, the codon for the start methionine is underlined. The open reading frame for variant 3 extends from 
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nucleic acid 82-654 including the stop codon. The nucleic acid and amino acid sequence of 121 PI Fl variant 4 
(SEQ1D. NO. : ) is shown in Figure 2F, the codon for the start methionine is underlined. The open reading 
frame for variant 4 extends from nucleic acid 281-853 including the stop codon. 

Figure 3. Amino acid sequence of 121 PI Fl (SEQ ID. NO. : ) is shown in Figure 3 A; it has 205 
5 amino acids. The amino acid sequence of 121P1F1 variant 1 A (SEQ1D. NO. : ) is shown in Figure 3B; it has 
126 amino acids. The amino acid sequence of 121 PI Fl variant IB (SEQ1D. NO. : ) is shown in Figure 3C, the 
121P1F1 variant IB protein has 1 19 amino acids. The amino acid sequence of 121 PI Fl variant 2 (SEQ I D. NO. 
: ) is shown in Figure 3D, the 12 1 PI Fl variant 2 protein has 122 amino acids. The amino acid sequence of 
121P1F1 variant 3 (SEQ ID. NO. : ) is shown in Figure 3E, the 121P1F1 variant 3 protein has 190 amino acids. 
10 The amino acid sequence of 1 2 1 P 1 F 1 variant 4 (SEQ ID. NO. : ) is shown in Figure 3 F, the 1 2 1 P 1 F 1 variant 4 
protein has 190 amino acids. 

Figure 4. A. The amino acid alignments of 121 PI Fl protein and variants 1 A, IB, 2, and 3. B. The 
amino acid alignments of 121P1F1 protein and variants 4 and 1 A. C. Alignment with human protein GAJ. D. 
Alignment with closest mouse homolog. E. Alignment with hypothetical yeast protein. 
15 Figure 5. Hydrophilicity amino acid profile of A) 121P1F1 and B) 121P1F1 varlA determined by 

computer algorithm sequence analysis using the method of Hopp and Woods (Hopp T.P., Woods K.R., 1981. Proc. 
Natl. Acad. Sci. U.S.A. 78:3824-3828) accessed on the Protscale website located on the World Wide Web at 
(www.expasy.ch/cgi-bin/protscale.pl) through the ExPasy molecular biology server. 

Figure 6. Hydropathicity amino acid profile of A) 121P1F1 and B) 121P1F1 varl A determined by 
20 computer algorithm sequence analysis using the method of Kyte and Doolittle (Kyte J., Doolittle R.F., 1982. J. Mol. 
Biol. 157:105-132) accessed on the ProtScale website located on the World Wide Web at (www.expasy.ch/cgi- 
bin/protscale.pl) through the ExPasy molecular biology server. 

Figure 7. Percent accessible residues amino acid profile of A) 121P1F1 and B) 121P1F1 varl A 
determined by computer algorithm sequence analysis using the method of Janin (Janin J., 1979 Nature 277:491- 
25 492) accessed on the ProtScale website located on the World Wide Web at (www.expasy.ch/cgi-bin/protscale.pl) 
through the ExPasy molecular biology server. 

Figure 8. Average flexibility amino acid profile of A) 121P1F1 and B) 121P1F1 varl A determined by 
computer algorithm sequence analysis using the method of Bhaskaran and Ponnuswamy (Bhaskaran R., and 
Ponnuswamy P.K., 1988. Int. J. Pept. Protein Res. 32:242-255) accessed on the ProtScale website located on the 
30 World Wide Web at (www.expasy.ch/cgi-bin/protscale. pi) through the ExPasy molecular biology server. 

Figure 9. Beta-turn amino acid profile of A) 121P1 Ft and B) 121P1F1 varlA determined by computer 
algorithm sequence analysis using the method of Deleage and Roux (Deleage, G., Roux B. 1987 Protein 
Engineering 1 :289-294) accessed on the ProtScale website located on the World Wide Web at (www.expasyxh/cgi- 
bin/protscale.pl) through the ExPasy molecular biology server. 
35 Figure 10. Nucleotide splice variants of 121 PI Fl . 

Figure 11. Protein splice variants of 121P1F1. 

Figure 12: Specific recognition of 121P1F1 antigen by anti-121PlFl polyclonal antibody. The indicated 
dilutions of anti-121PlFl polyclonal antibody serum or pre-immune serum was used to probe a blot containing 
GST-121P1F1 cleavage antigen. Reactivity was visualized by incubation with goat anti-rabbit HRP-conjugated 
40 secondary antibody and development by enhanced chemiluminescence and exposure to autoradiography film. 

Figure 13: Expression of 121 Pi Fl in various cancer cells. Anti-12 1 PI Fl polyclonal antibody was used 
to carry out Western blot analysis of 121 PI Fl expression in cell lysates from the indicated cancer cell lines and 
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Myc His tagged 121P1F1 expressed in 293T cells. Seen is specific anti-121PlFl reactive bands in each of the 
cancer cell lines indicative of endogenous 121P1F1 expression and possibly recognition of 121 PI Fl splice variants 
of different molecular weights. 

Figure 14: Expression of 1 2 1 P 1 F 1 in 293T cells. Cell ly sates of vector or pCDN A 3.1 -Myc His 121P1F1 
5 transfected 293T cells were subjected to Western analysis with anti-His polyclonal antibody (Santa Cruz 
Biotechnology). Seen is a 35 kD band representing expression of 121 PI Fl Myc His-tagged protein. 

Figure 15. Androgen regulation of 1 2 1 PI Fl in vivo. Male mice were injected with LAPC-9AD tumor 
cells. When tumor reached a palpable size (0.3-0.5cm in diameter), mice were castrated and tumors harvested at 
different time points following castration. RNA was isolated from the xenograft tissues. Northern blots with 10 ug 
10 of total RN A/lane were probed with the 121P1F1 SSH fragment. Size standards in kilobases (kb) are indicated on 
the side. Results show expression of 121 PI Fl is slightly downregulated 7 days after castration. The protein 
TMPRSS2 was used as a positive control. A picture of the ethidium-bromide staining of the RNA gel is also 
presented (lowest panel). 

Figure 16: Secondary structure prediction for 12 1 PI Fl (Figure 16A^ (SEP ID NO: 3) and variant la 
1 5 (Figure 1 6B) (SEQ ID NO: 5) . The secondary structure of 1 2 1 P 1 F 1 and variant 1 a proteins were predicted using 
the HNN - Hierarchical Neural Network method (Guermeur, 1997, Web URL pbil.ibcp.fr/cgi- 
bin/npsa_automat.pl?page=npsa_nn.html), accessed from the ExPasy molecular biology server located on the 
World Wide Web at ( W e b URL www .expasy.ch/tools/). This method predicts the presence and location of alpha 
helices, extended strands, and random coils from the primary protein sequence. The percent of the protein in a 
20 given secondary structure is also given. 

Figure 17. RT-PCR analysis of 121 PI Fl expression. First strand cDNA was prepared (A) from 8 human 
normal tissues, and (B) from vital pool 1 (VP1: liver, lung and kidney), vital pool 2 (VP2, pancreas, spleen and 
stomach), LAPC xenograft pool (XP; LAPC-4AD, LAPC-4AI, LAPC-9AD and LAPC-9AI), normal prostate (NP), 
prostate cancer pool, bladder cancer pool, kidney cancer pool, colon cancer pool and lung cancer pool. 
25 Normalization was performed by PCR using primers to actin and GAPDH. Semi-quantitative PCR, using primers 
to 121P1F1, was performed at 25 and 30 cycles of amplification. 

Figure 18. Expression of 121 PI Fl in normal human tissues by Northern blot analysis. Two multiple 
tissue northern blots (Clontech) with 2 ug of mRN A/lane, were probed with the 121P1F1 SSH fragment. Size 
standards in kilobases (kb) are indicated on the side. The results show exclusive expression of an approximately 1.2 
30 kb 121P1F1 transcript in testis and to a lower level in thymus. 

Figure 19. Expression of 121 PI Fl in cancer cell lines. RNA was extracted from a number of cancer cell 
lines. Northern blots with 10 ug of total RNA/lane were probed with the 12 1 PI Fl SSH fragment. Size standards in 
kilobases (kb) are indicated on the side. 

Figure 20. Expression of 121 PI Fl in prostate cancer patient samples. RNA was extracted from the 
35 prostate tumors (T) and their normal adjacent tissue (N) derived from prostate cancer patients. Tumors of patients 
1 , 2 and 3 have a Gleason score of 6. Tumors of patients 4, 5 and 6 have a Gleason score of 7. Tumors of patients 
7, 8 and 9 have a Gleason score of 9. Northern blots with 10 ug of total RNA/lane were probed with the 121pl Fl 
SSH fragment. Size standards in kilobases (kb) are indicated on the side. 

Figure 21. Expression of 121 PI Fl in human patient cancer specimens and cancer cell lines. Expression 
40 of 121 PI Fl was assayed in a panel of human cancers (T) and their respective matched normal tissues (N) on RNA 
dot blots. 121P1F1 expression was seen in kidney, breast, cervix, and stomach cancers. 121P1F1 was also found to 
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be highly expressed in a panel of cancer cell lines in the following cancer cell lines; HeLa, Daudi, K562, HL-60, 
G361, A549, MOLT-4, SW480, and Raji. 

Figure 22. Androgen regulation of 121P1F1 in vitro. LAPC-4 2 cells were grown in charcoal-stripped 
medium and stimulated with the synthetic androgen mibolerone, for either 14 or 24 hours. Northern blot was 
5 performed with lOug of total RNA for each sample, and probed with the 12 1 PI Fl SSH fragment. A picture of the 
ethidium-bromide staining of the RNA gel is also presented (lowest panel). Hybridization of the same northern blot 
with the androgen-dependent gene TMPRSS2 confirms the quality of the the androgen deprivation. The results 
show that the expression of 121 PI Fl goes down in absence of normal serum, and is modulated in presence of 
mibolerone, 24 hours after stimulation. 

10 

DETAILED DESCRIPTION OF THE INVENTION 
Outline of Sections 
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XI. ) Diagnostic and Prognostic Embodiments of 121P1F1. 

XII. ) Inhibition of 121P1F1 Protein Function 

XII.A.) Inhibition of 121P1F1 With Intracellular Antibodies 
XII. B.) Inhibition of 121P1F1 with Recombinant Proteins 
5 XII.C.) Inhibition of 121P1F1 Transcription or Translation 

XII.D.) General Considerations for Therapeutic Strategies 

XIII. ) KITS 



L) Definitions: 

10 Unless otherwise defined, all terms of art, notations and other scientific terms or terminology used herein 

are intended to have the meanings commonly understood by those of skill in the art to which this invention pertains. 
In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, 
and the inclusion of such definitions herein should not necessarily be construed to represent a substantial difference 
over what is generally understood in the art. Many of the techniques and procedures described or referenced herein 

15 are well understood and commonly employed using conventional methodology by those skilled in the art, such as, 
for example, the widely utilized molecular cloning methodologies described in Sambrook et aL, Molecular Cloning: 
A Laboratory Manual 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. As 
appropriate, procedures involving the use of commercially available kits and reagents are generally carried out in 
accordance with manufacturer defined protocols and/or parameters unless otherwise noted. 

20 The terms "advanced prostate cancer", "locally advanced prostate cancer", "advanced disease" and 

"locally advanced disease" mean prostate cancers that have extended through the prostate capsule, and are meant to 
include stage C disease under the American Urological Association (AUA) system, stage CI - C2 disease under the 
Whitmore-Jewett system, and stage T3 - T4 and N+ disease under the TNM (tumor, node, metastasis) system. In 
general, surgery is not recommended for patients with locally advanced disease, and these patients have 

25 substantially less favorable outcomes compared to patients having clinically localized (organ-confined) prostate 
cancer. Locally advanced disease is clinically identified by palpable evidence of induration beyond the lateral 
border of the prostate, or asymmetry or induration above the prostate base. Locally advanced prostate cancer is 
presently diagnosed pathologically following radical prostatectomy if the tumor invades or penetrates the prostatic 
capsule, extends into the surgical margin, or invades the seminal vesicles. 

30 "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 

carbohydrate moieties found in native sequence 121 PI Fl (either by removing the underlying glycosylation site or 
by deleting the glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites 
that are not present in the native sequence 12 1 PI Fl . In addition, the phrase includes qualitative changes in the 
glycosylation of the native proteins, involving a change in the nature and proportions of the various carbohydrate 

35 moieties present. 

The term "analog" refers to a molecule which is structurally similar or shares similar or corresponding 
attributes with another molecule (e.g. a 1 2 1 P 1 F 1 -related protein). For example an analog of a 1 2 1 P 1 F 1 protein can be 
specifically bound by an antibody or T cell that specifically binds to 121 PI Fl . 

The term "antibody" is used in the broadest sense. Therefore an "antibody" can be naturally occurring or man- 
40 made such as monoclonal antibodies produced by conventional hybridoma technology. Anti-121P1F1 antibodies 

comprise monoclonal and polyclonal antibodies as well as fragments containing the antigen-binding domain and/or one 
or more complementarity determining regions of these antibodies. 
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An "antibody fragment" is defined as at least a portion of the variable region of the immunoglobulin 
molecule that binds to its target, i.e., the antigen-binding region, in one embodiment it specifically covers single anti- 
121P1F1 antibodies and clones thereof (including agonist, antagonist and neutralizing antibodies) and anti-121PlFl 
antibody compositions with polyepitopic specificity. 
5 The term "codon optimized sequences" refers to nucleotide sequences that have been optimized for a 

particular host species by replacing any codons having a usage frequency of less than about 20%. Nucleotide 
sequences that have been optimized for expression in a given host species by elimination of spurious 
polyadenylation sequences, elimination of exon/intron splicing signals, elimination of transposon-like repeats 
and/or optimization of GC content in addition to codon optimization are referred to herein as an "expression 

1 0 enhanced sequences." 

The term "cytotoxic agent" refers to a substance that inhibits or prevents the expression activity of cells, 
function of cells and/or causes destruction of cells. The term is intended to include radioactive isotopes 
chemotherapeutic agents, and toxins such as small molecule toxins or enzymatically active toxins of bacterial, 
fungal, plant or animal origin, including fragments and/or variants thereof. Examples of cytotoxic agents include, 

1 5 but are not limited to maytansinoids, yttrium, bismuth, ricin, ricin A-chain, doxorubicin, daunorubicin, taxol, 
ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin 
dione, actinomycin, diphtheria toxin, Pseudomonas exotoxin (PE) A, PE40, abrin, abrin A chain, modeccin A chain, 
alpha-sarcin, gelonin, mitogellin, retstrictocin, phenomycin, enomycin, curicin, crotin, calicheamicin, sapaonaria 
officinalis inhibitor, and glucocorticoid and other chemotherapeutic agents, as well as radioisotopes such as At 211 , 

20 I 131 , I 125 , Y 90 , Re 186 , Re 188 , Sm 153 , Bi 212 , P 32 and radioactive isotopes of Lu. Antibodies may also be conjugated to an 
anti-cancer pro-drug activating enzyme capable of converting the pro-drug to its active form. 

The term "homolog" refers to a molecule which exhibits homology to another molecule, by for example, 
having sequences of chemical residues that are the same or similar at corresponding positions. 

"Human Leukocyte Antigen" or "HLA" is a human class 1 or class II Major Histocompatibility Complex 

25 (MHC) protein (see, e.g., Stites, et at., Immunology, 8 th Ed., Lange Publishing, Los Altos, CA (1994). 

The terms "hybridize", "hybridizing", "hybridizes" and the like, used in the context of polynucleotides, are 
meant to refer to conventional hybridization conditions, preferably such as hybridization in 50% 
formamide/6XSSC/0.1% SDS/100 ug/ml ssDNA, in which temperatures for hybridization are above 37 degrees C 
and temperatures for washing in 0.1XSSC/0.1% SDS are above 55 degrees C. 

30 The phrases "isolated" or "biologically pure" refer to material which is substantially or essentially free 

from components which normally accompany the material as it is found in its native state. Thus, isolated peptides 
in accordance with the invention preferably do not contain materials normally associated with the peptides in their 
in situ environment. For example, a polynucleotide is said to be "isolated" when it is substantially separated from 
contaminant polynucleotides that correspond or are complementary to genes other than the 12 1 PI F 1 genes or that 

35 encode polypeptides other than 121P1F1 gene product or fragments thereof. A skilled artisan can readily employ 

nucleic acid isolation procedures to obtain an isolated 121P1F1 polynucleotide. A protein is said to be "isolated," for 
example, when physical, mechanical or chemical methods are employed to remove the 121 PI Fl proteins from cellular 
constituents that are normally associated with the protein. A skilled artisan can readily employ standard purification 
methods to obtain an isolated 121 PI Fl protein. Alternatively, an isolated protein can be prepared by chemical means. 

40 The term "mammal" refers to any organism classified as a mammal, including mice, rats, rabbits, dogs, cats, 

cows, horses and humans. In one embodiment of the invention, the mammal is a mouse. In another embodiment of the 
invention, the mammal is a human. 
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The terms "metastatic prostate cancer" and "metastatic disease" mean prostate cancers that have spread to 
regional lymph nodes or to distant sites, and are meant to include stage D disease under the AU A system and stage 
TxNxM+ under the TOM system. As is the case with locally advanced prostate cancer, surgery is generally not 
indicated for patients with metastatic disease, and hormonal (androgen ablation) therapy is a preferred treatment 
5 modality. Patients with metastatic prostate cancer eventually develop an androgen-refractory state within 1 2 to 18 
months of treatment initiation. Approximately half of these androgen-refractory patients die within 6 months after 
developing that status. The most common site for prostate cancer metastasis is bone. Prostate cancer bone 
metastases are often osteoblastic rather than osteolytic (i.e., resulting in net bone formation). Bone metastases are 
found most frequently in the spine, followed by the femur, pelvis, rib cage, skull and humerus. Other common sites 
10 for metastasis include lymph nodes, lung, liver and brain. Metastatic prostate cancer is typically diagnosed by open 
or laparoscopic pelvic lymphadenectomy, whole body radionuclide scans, skeletal radiography, and/or bone lesion 
biopsy. 

The term "monoclonal antibody" refers to an antibody obtained from a population of substantially 
homogeneous antibodies, i.e., the antibodies comprising the population are identical except for possible naturally 

1 5 occurring mutations that are present in minor amounts. 

A "motif, as in biological motif of an 121PlFl-related protein, refers to any pattern of amino acids 
forming part of the primary sequence of a protein, that is associated with a particular function (e.g. protein-protein 
interaction, protein-DNA interaction, etc) or modification (e.g. that is phosphorylated, glycosylated or amidated), or 
localization (e.g. secretory sequence, nuclear localization sequence, etc.) or a sequence that is correlated with being 

20 immunogenic, either humorally or cellularly. A motif can be either contiguous or capable of being aligned to 
certain positions that are generally correlated with a certain function or property. In the context of HLA motifs, 
"motif refers to the pattern of residues in a peptide of defined length, usually a peptide of from about 8 to about 1 3 
amino acids for a class I HLA motif and from about 6 to about 25 amino acids for a class II HLA motif, which is 
recognized by a particular HLA molecule. Peptide motifs for HLA binding are typically different for each protein 

25 encoded by each human HLA allele and differ in the pattern of the primary and secondary anchor residues. 

A "pharmaceutical excipienf comprises a material such as an adjuvant, a carrier, pH-adjusting and 
buffering agents, tonicity adjusting agents, wetting agents, preservative, and the like. 

"Pharmaceutically acceptable" refers to a non-toxic, inert, and/or composition that is physiologically 
compatible with humans or other mammals. 

30 The term "polynucleotide" means a polymeric form of nucleotides of at least 10 bases or base pairs in 

length, either ribonucleotides or deoxynucleotides or a modified form of either type of nucleotide, and is meant to 
include single and double stranded forms of DNA and/or RNA. In the art, this term if often used interchangeably 
with "oligonucleotide". A polynucleotide can comprise a nucleotide sequence disclosed herein wherein thymidine 
(T), as shown for example in Figure 2, can also be uracil (U); this definition pertains to the differences between the 

35 chemical structures of DNA and RNA, in particular the observation that one of the four major bases in RNA is 
uracil (U) instead of thymidine (T). 

The term "polypeptide" means a polymer of at least about 4, 5, 6, 7, or 8 amino acids. Throughout the 
specification, standard three letter or single letter designations for amino acids are used. In the art, this term is often 
used interchangeably with "peptide" or "protein". 

40 An HLA "primary anchor residue" is an amino acid at a specific position along a peptide sequence which 

is understood to provide a contact point between the immunogenic peptide and the HLA molecule. One to three, 
usually two, primary anchor residues within a peptide of defined length generally defines a "motif for an 
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immunogenic peptide. These residues are understood to fit in close contact with peptide binding groove of an HLA 
molecule, with their side chains buried in specific pockets of the binding groove. In one embodiment, for example, 
the primary anchor residues for an HLA class I molecule are located at position 2 (from the amino terminal 
position) and at the carboxyl terminal position of a 8, 9, 10, 1 1, or 12 residue peptide epitope in accordance with the 
5 invention. In another embodiment, for example, the primary anchor residues of a peptide that will bind an HLA 
class II molecule are spaced relative to each other, rather than to the termini of a peptide, where the peptide is 
generally of at least 9 amino acids in length. The primary anchor positions for each motif and supermotif are set 
forth in Table IV. For example, analog peptides can be created by altering the presence or absence of particular 
residues in the primary and/or secondary anchor positions shown in Table IV. Such analogs are used to modulate 
10 the binding affinity and/or population coverage of a peptide comprising a particular HLA motif or supermotif. 

A "recombinant" DNA or RNA molecule is a DNA or RNA molecule that has been subjected to molecular 
manipulation in vitro. 

Non-limiting examples of small molecules include compounds that bind or interact with 121P1F1, ligands 
including hormones, neuropeptides, chemokines, odorants, phospholipids, and functional equivalents thereof that 

15 bind and preferably inhibit 121P1F1 protein function. Such non-limiting small molecules preferably have a 

molecular weight of less than about 10 kDa, more preferably below about 9, about 8, about 7, about 6, about 5 or 
about 4 kDa. In certain embodiments, small molecules physically associate with, or bind, 121P1F1 protein; are not 
found in naturally occurring metabolic pathways; and/or are more soluble in aqueous than non-aqueous solutions 
"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 

20 generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. In 
general, longer probes require higher temperatures for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denatured nucleic acid sequences to reanneal when 
complementary strands are present in an environment below their melting temperature. The higher the degree of 
desired homology between the probe and hybridizable sequence, the higher the relative temperature that can be 

25 used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more 
stringent, while lower temperatures less so. For additional details and explanation of stringency of hybridization 
reactions, see Ausubel et al. y Current Protocols in Molecular Biology, Wiley Interscience Publishers, (1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, are identified by, but not limited 
to, those that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 

30 chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a 

denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0. 1% bovine serum albumin/0. 1% 
Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 
mM sodium citrate at 42 °C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCI, 0.075 M sodium citrate), 50 
mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm 

35 DNA (50 ug/ml), 0.1% SDS, and 10% dextran sulfate at 42 °C, with washes at 42°C in 0.2 x SSC (sodium 

chloride/sodium, citrate) and 50% formamide at 55 °C, followed by a high-stringency wash consisting of 0.1 x SSC 
containing EDTA at 55 °C. "Moderately stringent conditions" are described by, but not limited to, those in 
Sambrook et aL, Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, and 
include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less 

40 stringent than those described above. An example of moderately stringent conditions is overnight incubation at 

37°C in a solution comprising: 20% formamide, 5 x SSC (150 mM NaCI, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/mL denatured sheared salmon sperm 
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DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

An HLA "supermotif ' is a peptide binding specificity shared by HLA molecules encoded by two or more 
HLA alleles. 

5 As used herein "to treat" or "therapeutic" and grammatically related terms, refer to any improvement of 

any consequence of disease, such as prolonged survival, less morbidity, and/or a lessening of side effects which are 
the byproducts of an alternative therapeutic modality; full eradication of disease is not required. 

A "transgenic animal" (e.g., a mouse or rat) is an animal having cells that contain a transgene, which 
transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic stage. A 

10 "transgene" is a DNA that is integrated into the genome of a cell from which a transgenic animal develops. 

As used herein, an HLA or cellular immune response "vaccine" is a composition that contains or encodes 
one or more peptides of the invention: There are numerous embodiments of such vaccines, such as a cocktail of one 
or more individual peptides; one or more peptides of the invention comprised by a polyepitopic peptide; or nucleic 
acids that encode such individual peptides or polypeptides, e.g., a minigene that encodes a polyepitopic peptide. 

15 The "one or more peptides" can include any whole unit integer from 1-150 or more, e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22, 23,24,25,26,27,28,29, 30,31,32, 33,34,35, 36,37,38,39,40, 
41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 
145, or 150 or more peptides of the invention. The peptides or polypeptides can optionally be modified, such as by 
lipidation, addition of targeting or other sequences. HLA class I peptides of the invention can be admixed with, or 

20 linked to, HLA class II peptides, to facilitate activation of both cytotoxic T lymphocytes and helper T lymphocytes. 
HLA vaccines can also comprise peptide-pulsed antigen presenting cells, e.g., dendritic cells. 

The term "variant" refers to a molecule that exhibits a variation from a described type or norm, such as a 
protein that has one or more different amino acid residues in the corresponding position(s) of a specifically described 
protein (e.g. the 121P1F1 protein shown in Figure 2 or Figure 3. An analog is an example of a variant protein. Splice 

25 isoforms and single nucleotides polymorphisms (SNPs) are further examples of variants. 

The "121PlFl-related proteins" of the invention include those specifically identified herein, as well as allelic 
variants, conservative substitution variants, analogs and homologs that can be isolated/generated and characterized 
without undue experimentation following the methods outlined herein or readily available in the art Fusion proteins 
that combine parts of different 121P1F1 proteins or fragments thereof, as well as fusion proteins of a 121P1F1 protein 

30 and a heterologous polypeptide are also included. Such 121P1F1 proteins are collectively referred to as the 121P1F1- 
related proteins, the proteins of the invention, or 1 2 1 P 1 F 1 . The term " 1 2 1 P 1 F 1 -related protein" refers to a polypeptide 
fragment or an 121P1F1 protein sequence of 4, 5,6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24,25, 
or more than 25 amino acids; or, at least 30, 35, 40, 45, 50, 55, 60, 65, 70, 80, 85, 90, 95, 100 or more than 100 
amino acids. 

35 1U 121P1F1 Polynucleotides 

One aspect of the invention provides polynucleotides corresponding or complementary to all or part of an 
121P1F1 gene, mRNA, and/or coding sequence, preferably in isolated form, including polynucleotides encoding an 
121PlFl-related protein and fragments thereof, DNA, RNA, DNA/RNA hybrid, and related molecules, 
polynucleotides or oligonucleotides complementary to an 121 PI Fl gene or mRNA sequence or a part thereof, and 
40 polynucleotides or oligonucleotides that hybridize to an 121P1F1 gene, mRNA, or to an 121 PI Fl encoding 

polynucleotide (collectively, "121P1F1 polynucleotides"). In all instances when referred to in this section, T can 
also be U in Figure 2. 
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Embodiments of a 121 PI Fl polynucleotide include: a 121P1F1 polynucleotide having the sequence shown 
in Figure 2, the nucleotide sequence of 121 PI Fl as shown in Figure 2 wherein T is U; at least 10 contiguous 
nucleotides of a polynucleotide having the sequence as shown in Figure 2; or, at least 10 contiguous nucleotides of 
a polynucleotide having the sequence as shown in Figure 2 where T is U. For example, embodiments of 12 1 PI Fl 
5 nucleotides comprise, without limitation: 

(I) a polynucleotide comprising, consisting essentially of, or consisting of a sequence as shown in 
Figure 2 (SEQ1DNQ: ) , wherein T can also be U; 

(II) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
10 Figure 2A (SEQ1DNQ: — ) , from nucleotide residue number 82 through nucleotide residue number 

696, followed by a stop codon, wherein T can also be U; 

(III) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
Figure 2B (SEQ1DNQ: from nucleotide residue number 82 through nucleotide residue number 
459, followed by a stop codon, wherein T can also be U; 

1 5 (IV) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 

Figure 2C (SEQ1DNQ: — ) , from nucleotide residue number 501 through nucleotide residue number 
857, followed by a stop codon, wherein T can also be U; 

(V) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
Figure 2D (SEQ1DNQ: from nucleotide residue number 82 through nucleotide residue number 

20 447, followed by a stop codon, wherein T can also be U; 

(VI) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
Figure 2E (SEQ ID NO: ) , from nucleotide residue number 82 through nucleotide residue number 
65 1 , followed by a stop codon, wherein T can also be U; 

(VII) a polynucleotide comprising, consisting essentially of, or consisting of the sequence as shown in 
25 Figure 2F (SEQ I D NO: ^ from nucleotide residue number 281 through nucleotide residue number 

850, followed by a stop codon, wherein T can also be U; 

(VIII) a polynucleotide that encodes an 121PlFl-related protein that is at least 90% homologous to an 
entire amino acid sequence shown in Figure 2A-F (SEQ ID NO: ) ; 

(IX) a polynucleotide that encodes an 121 PI Fl -related protein that is at least 90% identical to an entire 
30 amino acid sequence shown in Figure 2A-F (SEQ ID NO: ) ; 

(X) a polynucleotide that encodes at least one peptide set forth in Tables V-XVIII, XXVI, and XXVII; 

(XI) a polynucleotide that encodes a peptide region of at least 5 amino acids of a peptide of Figure 3 A 
in any whole number increment up to 205 that includes an amino acid position having a value greater than 
0.5 in the Hydrophilicity profile of Figure 5A, or of Figure 3B in any whole number increment up to 126 
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that includes an amino acid position having a value greater than 0.5 in the Hydrophilicity profile of Figure 
5B; 

(XII) a polynucleotide that encodes a peptide region of at least 5 amino acids of a peptide of Figure 3 A 
in any whole number increment up to 205 that includes an amino acid position having a value less than 0.5 
in the Hydropathicity profile of Figure 6A, or of Figure 3B in any whole number increment up to 126, that 
includes an amino acid position having a value less than 0.5 in the Hydropathicity profile of Figure 6B; 

(XIII) a polynucleotide that encodes a peptide region of at least 5 amino acids of a peptide of Figure 3 A 
in any whole number increment up to 205 that includes an amino acid position having a value greater than 
0.5 in the Percent Accessible Residues profile of Figure 7 A, or of Figure 3B in any whole number 
increment up to 126, that includes an amino acid position having a value greater than 0.5 in the Percent 
Accessible Residues profile of Figure 7B; 

(XIV) a polynucleotide that encodes a peptide region of at least 5 amino acids of a peptide of Figure 3 A 
in any whole number increment up to 205 that includes an amino acid position having a value greater than 
0.5 in the Average Flexibility profile on Figure 8A, or of Figure 3B in any whole number increment up to 
126, that includes an amino acid position having a value greater than 0.5 in the Average Flexibility profile 
on Figure 8B; 

(XV) a polynucleotide that encodes a peptide region of at least 5 amino acids of a peptide of Figure 3 A 
in any whole number increment up to 205 that includes an amino acid position having a value greater than 
0.5 in the Beta-turn profile of Figure 9 A, or of Figure 3B in any whole number increment up to 126, that 
includes an amino acid position having a value greater than 0.5 in the Beta-turn profile of Figure 9B; 

(XVI) a polynucleotide that encodes a 1 2 1 P 1 F 1 -related protein whose sequence is encoded by the 
cDNAs contained in the plasmid deposited with American Type Culture Collection as Accession No. PTA- 
3139 on Mar 1,2001; 

(XVII) a polynucleotide that is fully complementary to a polynucleotide of any one of (I)-(XVI); 

(XVIII) a polynucleotide that selectively hybridizes under stringent conditions to a polynucleotide of (I)- 
(XVII); 

(XIX) a peptide that is encoded by any of (IHXVIII); and, 

(XX) a polynucleotide of any of (I)-(XVIII)or peptide of (XIX) together with a pharmaceutical 
excipient and/or in a human unit dose form. 

As used herein, a range is understood to specifically disclose all whole unit positions thereof. 

Typical embodiments of the invention disclosed herein include 121P1F1 polynucleotides that encode 
specific portions of 12 1 PI Fl mRNA sequences (and those which are complementary to such sequences) such as 
those that encode the proteins and/or fragments thereof, for example: 
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(a) 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24,25,30,35,40,45,50,55,60, 
65,70, 75, 80, 85,90,95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 
190, 195, 200, or 205 contiguous amino acids of 121 PI Fl; 

(b) 4, 5,6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22, 23, 24, 25,30,35,40,45, 50,55,60, 
5 65, 70, 75, 80, 85, 90, 95, 100, 105, 1 10, 115, 120, 125, or 126 contiguous amino acids of variant 1 A; 

(c) 4,5,6, 7,8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22, 23,24,25,30,35,40,45, 50,55,60, 
65, 70, 75, 80, 85, 90, 95, 100, 105, 1 10, 1 15, or 1 19 contiguous amino acids of variant IB; 

(d) 4, 5,6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24,25, 30,35,40,45, 50, 55,60, 
65, 70, 75, 80, 85, 90, 95, 100, 105, 1 10, 1 15, 120, or 122 contiguous amino acids of variant 2; or, 

10 (e) 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23,24,25,30,35,40,45,50,55,60, 

65, 70, 75, 80, 85,90,95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 
or 190 contiguous amino acids of variant 3; or, 

(f)4, 5,6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21,22,23, 24,25,30,35, 40,45, 50,55,60, 
65, 70, 75, 80, 85,90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 

15 or 1 90 contiguous amino acids of variant 4. 

For example, representative embodiments of the invention disclosed herein include: polynucleotides and 
their encoded peptides themselves encoding about amino acid 1 to about amino acid 10 of the 121P1F1 protein or 
variants shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 10 to about amino acid 20 of the 
121P1F1 protein or variants shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 20 to about 

20 amino acid 30 of the 121P1F1 protein or variants shown in Figure 2 or Figure 3, polynucleotides encoding about 
amino acid 30 to about amino acid 40 of the 121 PI Fl protein or variants shown in Figure 2 or Figure 3, 
polynucleotides encoding about amino acid 40 to about amino acid 50 of the 121 PI Fl protein or variants shown in 
Figure 2 or Figure 3, polynucleotides encoding about amino acid 50 to about amino acid 60 of the 121P1F1 protein 
or variants shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 60 to about amino acid 70 of 

25 the 121 PI Fl protein or variants shown in Figure 2 or Figure 3, polynucleotides encoding about amino acid 70 to 
about amino acid 80 of the 121P1F1 protein or variants shown in Figure 2 or Figure 3, polynucleotides encoding 
about amino acid 80 to about amino acid 90 of the 121P1F1 protein or variants shown in Figure 2 or Figure 3, 
polynucleotides encoding about amino acid 90 to about amino acid 100 of the 121P1F1 protein or variants shown in 
Figure 2 or Figure 3, in increments of about 10 amino acids, ending at the carboxyl terminal amino acid set forth in 

30 Figure 2 or Figure 3. Accordingly polynucleotides encoding portions of the amino acid sequence (of about 10 
amino acids), of amino acids 100 through the carboxyl terminal amino acid of the 121P1F1 protein are 
embodiments of the invention. Wherein it is understood that each particular amino acid position discloses that 
position plus or minus five amino acid residues. 

Polynucleotides encoding relatively long portions of a 121 PI Fl protein are also within the scope of the 

35 invention. For example, polynucleotides encoding from about amino acid 1 (or 20 or 30 or 40 etc.) to about amino 
acid 20, (or 30, or 40 or 50 etc.) of the 121 PI Fl protein or variants shown in Figured or Figure 3 can be generated 
by a variety of techniques well known in the art. These polynucleotide fragments can include any portion of the 
121P1F1 sequence or variants as shown in Figure 2. 

Additional illustrative embodiments of the invention disclosed herein include 121P1F1 polynucleotide 

40 fragments encoding one or more of the biological motifs contained within a 1 2 1 P 1 F 1 protein sequence or variant 
sequence, including one or more of the motif-bearing subsequences of a 1 2 1 P 1 F 1 protein or variant set forth in 
Tables V-XVIII, XXVI, and XXVII. In another embodiment, typical polynucleotide fragments of the invention 
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encode one or more of the regions of 121P1F1 protein or variant that exhibit homology to a known molecule. In 
another embodiment of the invention, typical polynucleotide fragments can encode one or more of the 121P1F1 
protein or variant N-glycosylation sites, cAMP and cGMP-dependent protein kinase phosphorylation sites, casein 
kinase II phosphorylation sites or N-myristoylation site and amidation sites. 

5 

II. A.) Uses of 121P1F1 Polynucleotides 
H.A.I.) Monitoring of Genetic Abnormalities 

The polynucleotides of the preceding paragraphs have a number of different specific uses. The human 
121P1F1 gene maps to the chromosomal location set forth in Example 3. For example, because the 121P1F1 gene 

10 maps to this chromosome, polynucleotides that encode different regions of the 121P1F1 proteins are used to 

characterize cytogenetic abnormalities of this chromosomal locale, such as abnormalities that are identified as being 
associated with various cancers. In certain genes, a variety of chromosomal abnormalities including rearrangements 
have been identified as frequent cytogenetic abnormalities in a number of different cancers (see e.g. Krajinovic et 
aL, Mutat. Res. 382(3-4): 81-83 (1998); Johansson et ai, Blood 86(10): 3905-3914 (1995) and Finger et aL, 

15 P.N.A.S. 85(23): 9158-9162 (1988)). Thus, polynucleotides encoding specific regions of the 121 PI Fl proteins 
provide new tools that can be used to delineate, with greater precision than previously possible, cytogenetic 
abnormalities in the chromosomal region that encodes 12 1 PI Fl that may contribute to the malignant phenotype. In 
this context, these polynucleotides satisfy a need in the art for expanding the sensitivity of chromosomal screening 
in order to identify more subtle and less common chromosomal abnormalities (see e.g. Evans et ai y Am. J. Obstet. 

20 Gynecol 171(4): 1055-1057 (1994)). 

Furthermore, as 121P1FI was shown to be highly expressed in bladder and other cancers, 121P1F1 
polynucleotides are used in methods assessing the status of 121 PI Fl gene products in normal versus cancerous 
tissues. Typically, polynucleotides that encode specific regions of the 121P1F1 proteins are used to assess the 
presence of perturbations (such as deletions, insertions, point mutations, or alterations resulting in a loss of an 

25 antigen etc.) in specific regions of the 121 PI Fl gene, such as regions containing one or more motifs. Exemplary 

assays include both RT-PCR assays as well as single-strand conformation polymorphism (SSCP) analysis (see, e.g., 
Marrogi et aL, J. Cutan. Pathol. 26(8): 369-378 (1999), both of which utilize polynucleotides encoding specific 
regions of a protein to examine these regions within the protein. 
H.A.2.) Antisense Embodiments 

30 Other specifically contemplated nucleic acid related embodiments of the invention disclosed herein are 

genomic DNA, cDNAs, ribozymes, and antisense molecules, as well as nucleic acid molecules based on an alternative 
backbone, or including alternative bases, whether derived from natural sources or synthesized, and include molecules 
capable of inhibiting the RNA or protein expression of 1 2 1 P 1 F 1 . For example, antisense molecules can be RN As or 
other molecules, including peptide nucleic acids (PNAs) or non-nucleic acid molecules such as phosphorothioate 

35 derivatives, that specifically bind DNA or RNA in a base pair-dependent manner. A skilled artisan can readily obtain 
these classes of nucleic acid molecules using the 1 2 1 P 1 F 1 polynucleotides and polynucleotide sequences disclosed 
herein. 

Antisense technology entails the administration of exogenous oligonucleotides that bind to a target 
polynucleotide located within the cells. The term "antisense" refers to the fact that such oligonucleotides are 
40 complementary to their intracellular targets, e.g., 1 2 1 P 1 F 1 . See for example, Jack Cohen, Oligodeoxynucleotides, 
Antisense Inhibitors of Gene Expression, CRC Press, 1989; and Synthesis 1:1-5 (1988). The 12 1 PI Fl antisense 
oligonucleotides of the present invention include derivatives such as S-oligonucleotides (phosphorothioate 
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derivatives or S-oligos, see, Jack Cohen, supra), which exhibit enhanced cancer cell growth inhibitory action. S- 
oligos (nucleoside phosphorothioates) are isoelectronic analogs of an oligonucleotide (O-oligo) in which a 
nonbridging oxygen atom of the phosphate group is replaced by a sulfur atom. The S-oligos of the present 
invention can be prepared by treatment of the corresponding O-oligos with 3 H-l,2-benzodithiol-3-one- 1,1 -dioxide, 
which is a sulfur transfer reagent. See, e.g., Iyer, R. P. et at., J. Org. Chem. 55:4693-4698 (1990); and Iyer, R. P. et 
aL y J. Am. Chem. Soc. 1 12:1253-1254 (1990). Additional 121P1F1 antisense oligonucleotides of the present 
invention include morpholino antisense oligonucleotides known in the art (see, e.g., Partridge et al y 1996, 
Antisense & Nucleic Acid Drug Development 6: 169-175). 

The 121P1F1 antisense oligonucleotides of the present invention typically can be RNA or DNA that is 
complementary to and stably hybridizes with the first 100 5' codons or last 100 3' codons of a 121 PI Fl genomic 
sequence or the corresponding mRNA. Absolute complementarity is not required, although high degrees of 
complementarity are preferred. Use of an oligonucleotide complementary to this region allows for the selective 
hybridization to 121P1F1 mRNA and not to mRNA specifying other regulatory subunits of protein kinase. In one 
embodiment, 121P1F1 antisense oligonucleotides of the present invention are 15 to 30-mer fragments of the 
antisense DNA molecule that have a sequence that hybridizes to 121 PI Fl mRNA. Optionally, 121P1F1 antisense 
oligonucleotide is a 30-mer oligonucleotide that is complementary to a region in the first 10 5' codons or last 10 3' 
codons of 121 PI Fl. Alternatively, the antisense molecules are modified to employ ribozymes in the inhibition of 
121P1F1 expression, see, e.g., L. A. Couture & D. T. Stinchcomb; Trends Genet 12: 510-515 (1996). 

II.A.3/1 Primers and Primer Pairs 

Further specific embodiments of this nucleotides of the invention include primers and primer pairs, which 
allow the specific amplification of polynucleotides of the invention or of any specific parts thereof, and probes that 
selectively or specifically hybridize to nucleic acid molecules of the invention or to any part thereof. Probes can be 
labeled with a detectable marker, such as, for example, a radioisotope, fluorescent compound, bioluminescent 
compound, a chemiluminescent compound, metal chelator or enzyme. Such probes and primers are used to detect 
the presence of a 1 2 1 P 1 F 1 polynucleotide in a sample and as a means for detecting a cell expressing a 1 2 1 P 1 F 1 protein. 

Examples of such probes include polypeptides comprising all or part of the human 121P1F1 cDNA sequence 
shown in Figure 2. Examples of primer pairs capable of specifically amplifying 121P1F1 mRNAs are also described in 
the Examples. As will be understood by the skilled artisan, a great many different primers and probes can be prepared 
based on the sequences provided herein and used effectively to amplify and/or detect a 1 2 1 P 1 F 1 mRNA. 

The 121P1F1 polynucleotides of the invention are useful for a variety of purposes, including but not 
limited to their use as probes and primers for the amplification and/or detection of the 121 PI Fl gene(s), mRNA(s), 
or fragments thereof; as reagents for the diagnosis and/or prognosis of prostate cancer and other cancers; as coding 
sequences capable of directing the expression of 121 PI Fl polypeptides; as tools for modulating or inhibiting the 
expression of the 121 PI Fl gene(s) and/or translation of the 121 PI Fl transcript(s); and as therapeutic agents. 

The present invention includes the use of any probe as described herein to identify and isolate a 121 PI Fl or 
12 1 P 1 F 1 related nucleic acid sequence from a naturally occurring source, such as humans or other mammals, as well as 
the isolated nucleic acid sequence per se, which would comprise all or most of the sequences found in the probe used. 

I1.A>4^ Isolation of 121PlFl-Encoding Nucl eic Acid Molecules 

The 121P1F1 cDNA sequences described herein enable the isolation of other polynucleotides encoding 
121P1F1 gene produces), as well as the isolation of polynucleotides encoding 121P1F1 gene product homologs, 
alternatively spliced isoforms, allelic variants, and mutant forms of a 121P1F1 gene product as well as polynucleotides 
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that encode analogs of 121 PI Fl -related proteins. Various molecular cloning methods that can be employed to isolate 
full length cDNAs encoding an 121P1 Fl gene are well known (see, for example, Sambrook, J. et al. y Molecular 
Cloning: A Laboratory Manual, 2d edition, Cold Spring Harbor Press, New York, 1989; Current Protocols in Molecular 
Biology. Ausubel et al. y Eds., Wiley and Sons, 1995). For example, lambda phage cloning methodologies can be 
5 conveniently employed, using commercially available cloning systems (e.g., Lambda ZAP Express, Stratagene). Phage 
clones containing 121P1F1 gene cDNAs can be identified by probing with a labeled 121P1F1 cDNA or a fragment 
thereof For example, in one embodiment, a 121 PI Fl cDNA (e.g., Figure 2) or a portion thereof can be synthesized and 
used as a probe to retrieve overlapping and full-length cDNAs corresponding to a 121 PI Fl gene. A 121 PI Fl gene itself 
can be isolated by screening genomic DNA libraries, bacterial artificial chromosome libraries (BACs), yeast artificial 
10 chromosome libraries (YACs), and the like, with 121P1F1 DNA probes or primers. 

11.A.5/) Recombinant Nucleic Acid Molecules and Host- Vector Systems 

The invention also provides recombinant DNA or RNA molecules containing an 121 PI Fl polynucleotide, a 
fragment, analog or homologue thereof, including but not limited to phages, plasmids, phagemids, cosmids, YACs, 
BACs, as well as various viral and non-viral vectors well known in the art, and cells transformed or transfected with 

1 5 such recombinant DNA or RNA molecules. Methods for generating such molecules are well known (see, for example, 
Sambrook et ai., 1989, supra). 

The invention further provides a host-vector system comprising a recombinant DNA molecule containing a 
121P1F1 polynucleotide, fragment, analog or homologue thereof within a suitable prokaryotic or eukaryotic host cell. 
Examples of suitable eukaryotic host cells include a yeast cell, a plant cell, or an animal cell, such as a mammalian 

20 cell or an insect cell (e.g., a baculovirus-infectible cell such as an Sf9 or HighFive cell). Examples of suitable 
mammalian cells include various prostate cancer cell lines such as DU145 and TsuPrl, other transfectable or 
transducible prostate cancer cell lines, primary cells (PrEC), as well as a number of mammalian cells routinely used 
for the expression of recombinant proteins (e.g., COS, CHO, 293, 293T cells). More particularly, a polynucleotide 
comprising the coding sequence of 1 2 1 P 1 F 1 or a fragment, analog or homolog thereof can be used to generate 1 21 PI Fl 

25 proteins or fragments thereof using any number of host- vector systems routinely used and widely known in the art. 

A wide range of host- vector systems suitable for the expression of 121P1F1 proteins or fragments thereof are 
available, see for example, Sambrook et aL, 1989, supra; Current Protocols in Molecular Biology, 1995, supra). 
Preferred vectors for mammalian expression include but are not limited to pcDNA 3.1 myc-His-tag (Invitrogen) and 
the retroviral vector pSRatkneo (Muller et aL, 1991, MCB 1 1:1785). Using these expression vectors, 121P1F1 can 

30 be expressed in several prostate cancer and non-prostate cell lines, including for example 293, 293T, rat-1, N1H 3T3 
and TsuPrl. The host- vector systems of the invention are useful for the production of a 121 PI Fl protein or 
fragment thereof. Such host- vector systems can be employed to study the functional properties of 121 PI Fl and 
1 2 1 P 1 F 1 mutations or analogs. 

Recombinant human 121 P1F1 protein or an analog or homolog or fragment thereof can be produced by 

35 mammalian cells transfected with a construct encoding a 121 PI Fl -related nucleotide. For example, 293T cells can 
be transfected with an expression plasmid encoding 121 PI Fl or fragment, analog or homolog thereof, a 121 PI Fi- 
liated protein is expressed in the 293T cells, and the recombinant 121P1F1 protein is isolated using standard 
purification methods (e.g., affinity purification using anti-121PlFl antibodies). In another embodiment, a 121 PI Fl 
coding sequence is subcloned into the retroviral vector pSRctMSVtkneo and used to infect various mammalian cell 

40 lines, such as NIH3T3, TsuPrl, 293 and rat-1 in order to establish 121P1F1 expressing cell lines. Various other 
expression systems well known in the art can also be employed. Expression constructs encoding a leader peptide 
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joined in frame to a 121 PI Fl coding sequence can be used for the generation of a secreted form of recombinant 
121P1F1 protein. 

As discussed herein, redundancy in the genetic code permits variation in 121 PI Fl gene sequences. In 
particular, it is known in the art that specific host species often have specific codon preferences, and thus one can 
adapt the disclosed sequence as preferred for a desired host. For example, preferred analog codon sequences 
typically have rare codons (i.e., codons having a usage frequency of less than about 20% in known sequences of the 
desired host) replaced with higher frequency codons. Codon preferences for a specific species are calculated, for 
example, by utilizing codon usage tables available on the INTERNET such as at URL that is located on the World 
Wide Web at (www.dna.affrc.go.ip/~nakamura/codon.html). 

Additional sequence modifications are known to enhance protein expression in a cellular host. These 
include elimination of sequences encoding spurious polyadenylation signals, exon/intron splice site signals, 
transposon-like repeats, and/or other such we 11 -characterized sequences that are deleterious to gene expression. The 
GC content of the sequence is adjusted to levels average for a given cellular host, as calculated by reference to 
known genes expressed in the host cell. Where possible, the sequence is modified to avoid predicted hairpin 
secondary mRNA structures. Other useful modifications include the addition of a translational initiation consensus 
sequence at the start of the open reading frame, as described in Kozak, Mol. Cell Biol, 9:5073-5080 (1989). Skilled 
artisans understand that the general rule that eukaryotic ribosomes initiate translation exclusively at the 5' proximal 
AUG codon is abrogated only under rare conditions (see, e.g., Kozak PNAS 92(7): 2662-2666, (1995) and Kozak 
NAR 15(20): 8125-8148 (1987)). 

11U 121PlFl-related Proteins 

Another aspect of the present invention provides 121 PI Fl -related proteins. Specific embodiments of 
121P1F1 proteins comprise a polypeptide having all or part of the amino acid sequence of human 121 PI Fl as 
shown in Figure 2 or Figure 3. Alternatively, embodiments of 121P1F1 proteins comprise variant, homolog or 
analog polypeptides that have alterations in the amino acid sequence of 121P1F1 shown in Figure 2 or Figure 3. 

In general, naturally occurring allelic variants of human 121 PI Fl share a high degree of structural identity and 
homology (e.g., 90% or more homology). Typically, allelic variants of a 121P1F1 protein contain conservative amino 
acid substitutions within the 12 1P1 Fl sequences described herein or contain a substitution of an amino acid from a 
corresponding position in a homoiogue of 121P1F1. One class of 121P1F1 allelic variants are proteins that share a high 
degree of homology with at least a small region of a particular 121P1F1 amino acid sequence, but ftirther contain a 
radical departure from the sequence, such as a non-conservative substitution, truncation, insertion or frame shift. In 
comparisons of protein sequences, the terms, similarity, identity, and homology each have a distinct meaning as 
appreciated in the field of genetics. Moreover, orthology and paralogy can be important concepts describing the 
relationship of members of a given protein family in one organism to the members of the same family in other 
organisms. 

Amino acid abbreviations are provided in Table II. Conservative amino acid substitutions can frequently 
be made in a protein without altering either the conformation or the function of the protein. Proteins of the 
invention can comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15 conservative substitutions. Such changes 
include substituting any of isoleucine (I), valine (V), and leucine (L) for any other of these hydrophobic amino 
acids; aspartic acid (D) for glutamic acid (E) and vice versa; glutamine (Q) for asparagine (N) and vice versa; and 
serine (S) for threonine (T) and vice versa. Other substitutions can also be considered conservative, depending on 
the environment of the particular amino acid and its role in the three-dimensional structure of the protein. For 
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example, glycine (G) and alanine (A) can frequently be interchangeable, as can alanine (A) and valine (V). 
Methionine (M), which is relatively hydrophobic, can frequently be interchanged with leucine and isoleucine, and 
sometimes with valine. Lysine (K) and arginine (R) are frequently interchangeable in locations in which the 
significant feature of the amino acid residue is its charge and the differing pK's of these two amino acid residues are 
not significant. Still other changes can be considered "conservative" in particular environments (see, e.g. Table III 
herein; pages 13-15 "Biochemistry" 2 nd ED. Lubert Stryer ed (Stanford University); Henikoff et aL, PNAS 1992 
Vol 89 10915-10919; Lei et aL, J Biol Chem 1995 May 19; 270(20): 1 1 882-6). 

Embodiments of the invention disclosed herein include a wide variety of art-accepted variants or analogs 
of 121 PI Fl proteins such as polypeptides having amino acid insertions, deletions and substitutions. 121P1F1 
variants can be made using methods known in the art such as site-directed mutagenesis, alanine scanning, and PCR 
mutagenesis. Site-directed mutagenesis (Carter et aL, Nucl. Acids Res., 75:4331 (1986); Zoller et aL, Nucl. Acids 
Res., 70:6487 (1987)), cassette mutagenesis (Wells et aL, Gene, 34:315 (1985)), restriction selection mutagenesis 
(Wells et aL, Philos. Trans. R. Soc. London SerA, 3 17:415 (1986)) or other known techniques can be performed on 
the cloned DNA to produce the 121 PI Fl variant DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence that is involved in a specific biological activity such as a protein-protein interaction. Among 
the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acids include alanine, 
glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group because it 
eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation of the 
variant. Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently 
found in both buried and exposed positions (Creighton, The Proteins, (W.H. Freeman & Co., N.Y.); Chothia, J. 
Mol. Biol, 150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, an isosteric amino 
acid can be used. 

As defined herein, 12 1P1F1 variants, analogs or homologs, have the distinguishing attribute of having at 
least one epitope that is "cross reactive" with a 121P1F1 protein having an amino acid sequence of Figure 3. As 
used in this sentence, "cross reactive" means that an antibody or T cell that specifically binds to an 121 PI Fl variant 
also specifically binds to a 121 PI Fl protein having an amino acid sequence set forth in Figure 3. A polypeptide 
ceases to be a variant of a protein shown in Figure 3, when it no longer contains any epitope capable of being 
recognized by an antibody or T cell that specifically binds to the starting 121P1F1 protein. Those skilled in the art 
understand that antibodies that recognize proteins bind to epitopes of varying size, and a grouping of the order of 
about four or five amino acids, contiguous or not, is regarded as a typical number of amino acids in a minimal 
epitope. See, e.g., Nair et aL, J. Immunol 2000 165(12): 6949-6955; Hebbes et aL, Mol Immunol (1989) 
26(9):865-73; Schwartz et aL, J Immunol (1985) 135(4):2598-608. 

Other classes of 121 PI Fl -related protein variants share 70%, 75%, 80%, 85% or 90% or more similarity 
with an amino acid sequence of Figure 3,or a fragment thereof. Another specific class of 121P1F1 protein variants 
or analogs comprise one or more of the 121P1F1 biological motifs described herein or presently known in the art. 
Thus, encompassed by the present invention are analogs of 121 PI Fl fragments (nucleic or amino acid) that have 
altered functional (e.g. immunogenic) properties relative to the starting fragment. It is to be appreciated that motifs 
now or which become part of the art are to be applied to the nucleic or amino acid sequences of Figure 2 or Figure 
3. 

As discussed herein, embodiments of the claimed invention include polypeptides containing less than the 
full amino acid sequence of a 121 PI Fl protein shown in Figure 2 or Figure 3. For example, representative 

21 



511582003440 



embodiments of the invention comprise peptides/proteins having any 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more 
contiguous amino acids of a 121P1F1 protein shown in Figure 2 or Figure 3. 

Moreover, representative embodiments of the invention disclosed herein include polypeptides consisting of 
about amino acid 1 to about amino acid 10 of a 121 PI Fl protein shown in Figure 2 or Figure 3, polypeptides 
5 consisting of about amino acid 10 to about amino acid 20 of a 121 PI Fl protein shown in Figure 2 or Figure 3, 

polypeptides consisting of about amino acid 20 to about amino acid 30 of a 121 PI Fl protein shown in Figure 2 or 
Figure 3, polypeptides consisting of about amino acid 30 to about amino acid 40 of a 121P1F1 protein shown in 
Figure 2 or Figure 3, polypeptides consisting of about amino acid 40 to about amino acid 50 of a 121P1F1 protein 
shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 50 to about amino acid 60 of a 121P1F1 

10 protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 60 to about amino acid 70 of a 
121P1F1 protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 70 to about amino acid 
80 of a 121P1F1 protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 80 to about 
amino acid 90 of a 121P1F1 protein shown in Figure 2 or Figure 3, polypeptides consisting of about amino acid 90 
to about amino acid 100 of a 121P1F1 protein shown in Figure 2 or Figure 3, etc. throughout the entirety of a 

15 121P1F1 amino acid sequence. Moreover, polypeptides consisting of about amino acid 1 (or 20 or 30 or 40 etc.) to 
about amino acid 20, (or 130, or 140 or 150 etc.) of a 121 PI Fl protein shown in Figure 2 or Figure 3 are 
embodiments of the invention. It is to be appreciated that the starting and stopping positions in this paragraph refer 
to the specified position as well as that position plus or minus 5 residues. 

121PlFl-related proteins are generated using standard peptide synthesis technology or using chemical 

20 cleavage methods well known in the art. Alternatively, recombinant methods can be used to generate nucleic acid 
molecules that encode a 1 2 1 P 1 F 1 -related protein. In one embodiment, nucleic acid molecules provide a means to 
generate defined fragments of a 121 PI Fl protein (or variants, homologs or analogs thereof). 
Ill.A.) Motif-bearing Protein Embodiments 

Additional illustrative embodiments of the invention disclosed herein include 121P1F1 polypeptides 
25 comprising the amino acid residues of one or more of the biological motifs contained within a 121 PI Fl polypeptide 
sequence set forth in Figure 2 or Figure 3. Various motifs are known in the art, and a protein can be evaluated for 
the presence of such motifs by a number of publicly available Internet sites located on the World Wide Web (see, 
e.g., URL addresses: pfam.wustl.edu/; http://searchlauncher.bcm.tmc.edu/seq-search/struc-predict.html; 
psort.ims.u-tokyo.ac.jp/; www.cbs.dtu.dk/; www.ebi.ac.uk/interpro/scan.html; www.expasyxh/tools/scnpsitl .html; 
30 Epimatrix™ and Epimer™, Brown University, w*vw.brown.edu/Research/TB-HIV_^ and 
BIMAS, bimas.dcrt.nih.gov/.). 

Motif bearing subsequences of all 121P1F1 variant proteins are set forth and identified in Table XIX. 
Table XX sets forth several frequently occurring motifs based on pfam searches (see URL address 
pfam.wustl.edu/). The columns of Table XX list (1) motif name abbreviation, (2) percent identity found amongst 
35 the different member of the motif family, (3) motif name or description and (4) most common function; location 
information is included if the motif is relevant for location. 

Polypeptides comprising one or more of the 121 PI Fl motifs discussed above are useful in elucidating the 
specific characteristics of a malignant phenotype in view of the observation that the 121 PI Fl motifs discussed 
above are associated with growth dysregulation and because 121P1F1 is overexpressed in certain cancers (See, e.g., 
40 Table I). Casein kinase II, cAMP and camp-dependent protein kinase, and Protein Kinase C, for example, are 
enzymes known to be associated with the development of the malignant phenotype (see e.g. Chen et al. 9 Lab 
Invest, 78(2): 165-174 (1998); Gaiddon et aL, Endocrinology 136(10): 4331-4338 (1995); Hall et aL, Nucleic 
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Acids Research 24(6): 1 1 19-1 126 (1996); Peterziel et al, Oncogene 18(46): 6322-6329 (1999) and O'Brian, Oncol. 
Rep. 5(2): 305-309 (1998)). Moreover, both glycosylation and myristoylation are protein modifications also 
associated with cancer and cancer progression (see e.g. Dennis et al, Biochem. Biophys. Acta 1473(l):21-34 
(1999); Raju et al, Exp. Cell Res. 235(1): 145-154 (1997)). Amidation is another protein modification also 
5 associated with cancer and cancer progression (see e.g. Treston et al, J. Natl. Cancer Inst. Monogr. (13): 169-175 
(1992)). 

In another embodiment, proteins of the invention comprise one or more of the immunoreactive epitopes 
identified in accordance with art-accepted methods, such as the peptides set forth in Tables V-XVIII, XXVI, and 
XXVII. CTL epitopes can be determined using specific algorithms to identify peptides within an 121P1F1 protein that 

10 are capable of optimally binding to specified HLA alleles (e.g., Table IV; Epimatrix™ and Epimer™, Brown University, 
URL located on the World Wide Web at wAVw.brown.edu/Research/TB-HIVJ^b/epim^ and 
BIMAS, URL bimas.dcrt.nih.gov/.) Moreover, processes for identifying peptides that have sufficient binding affinity 
for HLA molecules and which are correlated with being immunogenic epitopes, are well known in the art, and are 
carried out without undue experimentation. In addition, processes for identifying peptides that are immunogenic 

15 epitopes, are well known in the art, and are carried out without undue experimentation either in vitro or in vivo. 
Also known in the art are principles for creating analogs of such epitopes in order to modulate 
immunogenicity. For example, one begins with an epitope that bears a CTL or HTL motif (see, e.g., the HLA Class 
I and HLA Class II motifs/supermotifs of Table IV). The epitope is analoged by substituting out an amino acid at 
one of the specified positions, and replacing it with another amino acid specified for that position. For example, 

20 one can substitute out a deleterious residue in favor of any other residue, such as a preferred residue as defined in 
Table IV; substitute a less-preferred residue with a preferred residue as defined in Table IV; or substitute an 
originally-occurring preferred residue with another preferred residue as defined in Table IV. Substitutions can 
occur at primary anchor positions or at other positions in a peptide; see, e.g., Table IV. 

A variety of references reflect the art regarding the identification and generation of epitopes in a protein of 

25 interest as well as analogs thereof. See, for example, WO 9733602 to Chesnut et al.; Sette, Immunogenetics 1999 
50(3-4): 201-212; Sette et al., J. Immunol. 2001 166(2): 1389-1397; Sidney et al, Hum. Immunol. 1997 58(1): 12- 
20; Kondo et al., Immunogenetics 1997 45(4): 249-258; Sidney et al., J. Immunol. 1996 157(8): 3480-90; and Falk 
et al., Nature 351: 290-6 (1991); Hunt et al., Science 255:1261-3 (1992); Parker et al., J. Immunol. 149:3580-7 
(1992); Parker et al., J. Immunol. 152:163-75 (1994)); Kast et al., 1994 152(8): 3904-12; Borras-Cuesta et al., 

30 Hum. Immunol. 2000 61(3): 266-278; Alexander et al, J. Immunol. 2000 164(3); 164(3): 1625-1633; Alexander et 
al, PMID: 7895164, Ul: 95202582; O'Sullivan et al, J. Immunol. 1991 147(8): 2663-2669; Alexander et al, 
Immunity 1994 1(9): 751-761 and Alexander et al, Immunol. Res. 1998 18(2): 79-92. 

Related embodiments of the inventions include polypeptides comprising combinations of the different 
motifs set forth in Table XIX, and/or, one or more of the predicted CTL epitopes of Table V through Table XVIII, 

35 and/or, one or more of the T cell binding motifs known in the art. Preferred embodiments contain no insertions, 
deletions or substitutions either within the motifs or the intervening sequences of the polypeptides. In addition, 
embodiments which include a number of either N-terminal and/or C-terminai amino acid residues on either side of 
these motifs may be desirable (to, for example, include a greater portion of the polypeptide architecture in which the 
motif is located). Typically the number of N-terminal and/or C-terminal amino acid residues on either side of a 

40 motif is between about 1 to about 100 amino acid residues, preferably 5 to about 50 amino acid residues. 

121PlFl-related proteins are embodied in many forms, preferably in isolated form. A purified 121P1F1 
protein molecule will be substantially free of other proteins or molecules that impair the binding of 121 PI Fl to 
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antibody, T cell or other ligand. The nature and degree of isolation and purification will depend on the intended use. 
Embodiments of a 121PlFl-related proteins include purified 121PlFl-related proteins and functional, soluble 
121 PI Fl -related proteins. In one embodiment, a functional, soluble 121 PI Fl protein or fragment thereof retains 
the ability to be bound by antibody, T cell or other ligand. 
5 The invention also provides 121P1F1 proteins comprising biologically active fragments of a 121P1F1 

amino acid sequence shown in Figure 2 or Figure 3. Such proteins exhibit properties of the starting 121P1F1 
protein, such as the ability to elicit the generation of antibodies that specifically bind an epitope associated with the 
starting 121P1F1 protein; to be bound by such antibodies; to elicit the activation of HTL or CTL; and/or, to be 
recognized by HTL or CTL that also specifically bind to the starting protein. 

10 1 2 1 P 1 F 1 -related polypeptides that contain particularly interesting structures can be predicted and/or identified 

using various analytical techniques well known in the art, including, for example, the methods of Chou-Fasman, 
Garnier-Robson, Kyte-Doolittle, Eisenberg, Karplus-Schultz or Jameson- Wolf analysis, or on the basis of 
immunogenicity. Fragments that contain such structures are particularly useful in generating subunit-specific anti- 
121P1F1 antibodies, or T cells or in identifying cellular factors that bind to 121P1F1. For example, hydrophi lie ity 

15 profiles can be generated, and immunogenic peptide fragments identified, using the method of Hopp, T.P. and 
Woods, K.R., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:3824-3828. Hydropathic ity profiles can be generated, and 
immunogenic peptide fragments identified, using the method of Kyte, J. and Doolittle, R.F., 1982, J. Mol. Biol. 
157:105-132. Percent (%) Accessible Residues profiles can be generated, and immunogenic peptide fragments 
identified, using the method of Janin J., 1979, Nature 277:491-492. Average Flexibility profiles can be generated, 

20 and immunogenic peptide fragments identified, using the method of Bhaskaran R., Ponnuswamy P.K., 1988, Int. J. 
Pept. Protein Res. 32:242-255. Beta-turn profiles can be generated, and immunogenic peptide fragments identified, 
using the method of Deleage, G., Roux B., 1987, Protein Engineering 1:289-294. 

CTL epitopes can be determined using specific algorithms to identify peptides within an 121P1F1 protein that 
are capable of optimally binding to specified HLA alleles (e.g., by using the SYFPEITH1 site at World Wide Web URL 

25 syfpeithi.bmi-heidelberg.com/; the listings in Table IV(A)-(E); Epimatrix™ and Epimer™, Brown University, URL 
located on the World Wide Web at (www.brown.edu/ResearchArB-HIV Lab/epimatrix/epimatrix.html); and BIMAS, 
URL bimas.dcrt.nih.gov/). Illustrating this, peptide epitopes from 121P1F1 that are presented in the context of 
human MHC class I molecules HLA-A1, A2, A3, Al 1, A24, B7 and B35 were predicted (Tables V-XVIII, XXVI, 
and XXVII). Specifically, the complete amino acid sequence of the 121P1F1 protein and relevant portions of other 

30 variants, i.e., for HLA Class I predictions 9 flanking redisues on either side of a point mutation, and for HLA Class 
II predictions 14 flanking residues on either side of a point mutation, were entered into the HLA Peptide Motif 
Search algorithm found in the Bioinformatics and Molecular Analysis Section (BIMAS) web site listed above; for 
HLA Class II the site SYFPEITHI at URL syfpeithi.bmi-heidelberg.com/ was used. 

The HLA peptide motif search algorithm was developed by Dr. Ken Parker based on binding of specific 

35 peptide sequences in the groove of HLA Class I molecules, in particular HLA-A2 (see, e.g., Falk et al y Nature 35 1 : 
290-6 (1991); Hunt et al. 9 Science 255:1261-3 (1992); Parker et aL, J. Immunol. 149:3580-7 (1992); Parker et aL, J. 
Immunol. 152:163-75 (1994)). This algorithm allows location and ranking of 8-mer, 9-mer, and 10-mer peptides 
from a complete protein sequence for predicted binding to HLA-A2 as well as numerous other HLA Class I 
molecules. Many HLA class I binding peptides are 8-, 9-, 10 or 1 1-mers. For example, for class I HLA-A2, the 

40 epitopes preferably contain a leucine (L) or methionine (M) at position 2 and a valine (V) or leucine (L) at the C- 
terminus (see, e.g., Parker et aL, J. Immunol. 149:3580-7 (1992)). Selected results of 121P1F1 predicted binding 
peptides are shown in Tables V-XVIII, XXVI, and XXVII herein. In Tables V-XVIII, the top 50 ranking 
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candidates, 9-mers and 10-mers, for each family member are shown along with their location, the amino acid 
sequence of each specific peptide, and an estimated binding score. The binding score corresponds to the estimated 
half time of dissociation of complexes containing the peptide at 37°C at pH 6.5. Peptides with the highest binding 
score are predicted to be the most tightly bound to HLA Class I on the cell surface for the greatest period of time 
5 and thus represent the best immunogenic targets for T-cell recognition. 

Actual binding of peptides to an HLA allele can be evaluated by stabilization of HLA expression on the 
antigen-processing defective cell line T2 (see, e.g., Xue et ai y Prostate 30:73-8 (1997) and Peshwa et ai, Prostate 
36: 129-38 (1998)). Immunogenicity of specific peptides can be evaluated in vitro by stimulation of CD8+ cytotoxic 
T lymphocytes (CTL) in the presence of antigen presenting cells such as dendritic cells. 

10 It is to be appreciated that every epitope predicted by the BIMAS site, Epimer™ and Epimatrix™ sites, or 

specified by the HLA class I or class II motifs available in the art or which become part of the art such as set forth 
in Table IV (or determined using World Wide Web site URL syfpeithi.bmi-heidelberg.com/, or BIMAS, 
bimas.dcrt.nih.gov/) are to be "applied" to a 121P1F1 protein in accordance with the invention. As used in this 
context "applied" means that a 121 PI Fl protein is evaluated, e.g., visually or by computer-based patterns finding 

1 5 methods, as appreciated by those of skill in the relevant art. Every subsequence of a 1 2 1 P 1 F 1 protein of 8, 9, 1 0, or 
1 1 amino acid residues that bears an HLA Class I motif, or a subsequence of 9 or more amino acid residues that 
bear an HLA Class II motif are within the scope of the invention. 
III.B.l Expression of 121PlFl-related Proteins 

In an embodiment described in the examples that follow, 12 1 PI Fl can be conveniently expressed in cells 

20 (such as 293T cells) transfected with a commercially available expression vector such as a CMV-driven expression 
vector encoding 121P1F1 with a C-terminal 6XHis and MYC tag (pcDNA3.1/mycHIS, Invitrogen or Tag5, 
GenHunter Corporation, Nashville TN). The Tag5 vector provides an IgGK secretion signal that can be used to 
facilitate the production of a secreted 121 PI Fl protein in transfected cells. The secreted HIS-tagged 121P1F1 in 
the culture media can be purified, e.g., using a nickel column using standard techniques. 

25 11LC.) Modifications of 121PlFl-related Proteins 

Modifications of !21PlFl-related proteins such as covalent modifications are included within the scope of 
this invention. One type of covalent modification includes reacting targeted amino acid residues of a 121 PI Fl 
polypeptide with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C- 
terminal residues of a 121 PI Fl protein. Another type of covalent modification of a 121 PI Fl polypeptide included 

30 within the scope of this invention comprises altering the native glycosylation pattern of a protein of the invention. 
Another type of covalent modification of 121 PI Fl comprises linking a 121 PI Fl polypeptide to one of a variety of 
nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in the 
manner set forth in U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

The 121 PI Fl -related proteins of the present invention can also be modified to form a chimeric molecule 

35 comprising 121P1F1 fused to another, heterologous polypeptide or amino acid sequence. Such a chimeric molecule 
can be synthesized chemically or recombinantly. A chimeric molecule can have a protein of the invention fused to 
another tumor-associated antigen or fragment thereof. Alternatively, a protein in accordance with the invention can 
comprise a fusion of fragments of a 121 PI Fl sequence (amino or nucleic acid) such that a molecule is created that 
is not, through its length, directly homologous to the amino or nucleic acid sequences shown in Figure 2 or Figure 

40 3. Such a chimeric molecule can comprise multiples of the same subsequence of 121P1F1. A chimeric molecule 
can comprise a fusion of a 121 PI Fl -related protein with a polyhistidine epitope tag, which provides an epitope to 
which immobilized nickel can selectively bind, with cytokines or with growth factors. The epitope tag is generally 



511582003440 



placed at the amino- or carboxyl- terminus of al21PlFl protein. In an alternative embodiment, the chimeric 
molecule can comprise a fusion of a 121PlFl-related protein with an immunoglobulin or a particular region of an 
immunoglobulin. For a bivalent form of the chimeric molecule (also referred to as an "immunoadhesin"), such a 
fusion could be to the Fc region of an IgG molecule. The Ig fusions preferably include the substitution of a soluble 
5 (transmembrane domain deleted or inactivated) form of a 121P1F1 polypeptide in place of at least one variable 

region within an Ig molecule. In a preferred embodiment, the immunoglobulin fusion includes the hinge, CH2 and 
CH3, or the hinge, CHI, CH2 and CH3 regions of an IgGI molecule. For the production of immunoglobulin fusions 
see, e.g., U.S. Patent No. 5,428,130 issued June 27, 1995. 
IILD.) Uses of 121PlFl-related Proteins 

1 0 The proteins of the invention have a number of different specific uses. As 1 2 1 P 1 F 1 is highly expressed in 

prostate and other cancers, 121 PI Fl -related proteins are used in methods that assess the status of 121P1F1 gene 
products in normal versus cancerous tissues, thereby elucidating the malignant phenotype. Typically, polypeptides 
from specific regions of a 121P1F1 protein are used to assess the presence of perturbations (such as deletions, 
insertions, point mutations etc.) in those regions (such as regions containing one or more motifs). Exemplary assays 

15 utilize antibodies or T cells targeting 121 PI Fl -related proteins comprising the amino acid residues of one or more 
of the biological motifs contained within a 121 PI Fl polypeptide sequence in order to evaluate the characteristics of 
this region in normal versus cancerous tissues or to elicit an immune response to the epitope. Alternatively, 
1 2 1P1F1 -related proteins that contain the amino acid residues of one or more of the biological motifs in a 121 PI Fl 
protein are used to screen for factors that interact with that region of 121 PI Fl. 

20 121P1F1 protein fragments/subsequences are particularly useful in generating and characterizing domain- 

specific antibodies (e.g., antibodies recognizing an extracellular or intracellular epitope of an 121 PI Fl protein), for 
identifying agents or cellular factors that bind to 1 2 1 P 1 F 1 or a particular structural domain thereof, and in various 
therapeutic and diagnostic contexts, including but not limited to diagnostic assays, cancer vaccines and methods of 
preparing such vaccines. 

25 Proteins encoded by the 121P1F1 genes, or by analogs, homologs or fragments thereof, have a variety of 

uses, including but not limited to generating antibodies and in methods for identifying ligands and other agents and 
cellular constituents that bind to an 121P1F1 gene product. Antibodies raised against an 121P1F1 protein or fragment 
thereof are useful in diagnostic and prognostic assays, and imaging methodologies in the management of human 
cancers characterized by expression of 121 PI Fl protein, such as those listed in Table I. Such antibodies can be 

30 expressed intracellular^ and used in methods of treating patients with such cancers. 1 2 1 P 1 F 1 -related nucleic acids 
or proteins are also used in generating HTL or CTL responses. 

Various immunological assays useful for the detection of 121P1F1 proteins are used, including but not limited 
to various types of radioimmunoassays, enzyme-linked immunosorbent assays (ELISA), enzyme-linked 
immunofluorescent assays (ELIFA), immunocytochemical methods, and the like. Antibodies can be labeled and used as 

3 5 immunological imaging reagents capable of detecting 1 2 1 P 1 F 1 -expressing cells (e.g., in radioscintigraphic imaging 
methods). 121 PI Fl proteins are also particularly useful in generating cancer vaccines, as further described herein. 
IV/> 121P1F1 Antibodies 

Another aspect of the invention provides antibodies that bind to 1 2 1 P 1 F 1 -related proteins. Preferred 
antibodies specifically bind to a 12 1 PI Fl -related protein and do not bind (or bind weakly) to peptides or proteins that 
40 are not 1 2 1 P 1 F 1 -related proteins. For example, antibodies that bind 121P1F1 can bind 121P1F1 -related proteins such 
as the homologs or analogs thereof. 
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121P1F1 antibodies of the invention are particularly useful in cancer (see, e.g., Table 1) diagnostic and 
prognostic assays, and imaging methodologies. Similarly, such antibodies are useful in the treatment, diagnosis, 
and/or prognosis of other cancers, to the extent 12 1P1 Fl is also expressed or overexpressed in these other cancers. 
Moreover, intracellularly expressed antibodies (e.g., single chain antibodies) are therapeutically useful in treating 
5 cancers in which the expression of 12 1 PI Fl is involved, such as advanced or metastatic prostate cancers. 

The invention also provides various immunological assays useful for the detection and quantification of 
1 2 1 P 1 F 1 and mutant 1 2 1 P 1 F 1 -related proteins. Such assays can comprise one or more 1 2 1 P 1 F 1 antibodies capable of 
recognizing and binding a 1 2 1 P 1 F 1 -related protein, as appropriate. These assays are performed within various 
immunological assay formats well known in the art, including but not limited to various types of radioimmunoassays, 
10 enzyme-linked immunosorbent assays (ELISA), enzyme-linked immunofluorescent assays (ELIFA), and the like. 

Immunological non-antibody assays of the invention also comprise T cell immunogenic ity assays (inhibitory 
or stimulatory) as well as major histocompatibility complex (MHC) binding assays. 

In addition, immunological imaging methods capable of detecting prostate cancer and other cancers expressing 
121P1F1 are also provided by the invention, including but not limited to radioscintigraphic imaging methods using 
15 labeled 121P1F1 antibodies. Such assays are clinically useful in the detection, monitoring, and prognosis of 121P1F1 
expressing cancers such as prostate cancer. 

1 2 1 P 1 F 1 antibodies are also used in methods for purifying a 1 2 1 P 1 F 1 -related protein and for isolating 
1 2 1 P 1 F 1 homologues and related molecules. For example, a method of purifying a 1 2 1 P 1 F 1 -related protein comprises 
incubating an 12 1 PI Fl antibody, which has been coupled to a solid matrix, with a lysate or other solution containing a 
20 121 PI Fl -related protein under conditions that permit the 121P1F1 antibody to bind to the 12 1 PI Fl -related protein; 
washing the solid matrix to eliminate impurities; and eluting the 1 2 1 P 1 F 1 -related protein from the coupled antibody. 
Other uses of 121 PI Fl antibodies in accordance with the invention include generating anti-idiotypic antibodies that 
mimic a 121P1F1 protein. 

Various methods for the preparation of antibodies are well known in the art. For example, antibodies can be 

25 prepared by immunizing a suitable mammalian host using a 1 2 1 P 1 F 1 -related protein, peptide, or fragment, in isolated or 
immunoconjugated form (Antibodies: A Laboratory Manual, CSH Press, Eds., Harlow, and Lane (1988); Harlow, 
Antibodies, Cold Spring Harbor Press, NY (1 989)). In addition, fusion proteins of 1 2 1 P 1 F 1 can also be used, such as a 
121P1 Fl GST-fusion protein. In a particular embodiment, a GST fusion protein comprising all or most of the amino 
acid sequence of Figure 2 or Figure 3 is produced, then used as an immunogen to generate appropriate antibodies. In 

30 another embodiment, a 121PlFl-related protein is synthesized and used as an immunogen. 

In addition, naked DNA immunization techniques known in the art are used (with or without purified 
12 IP 1F1 -related protein or 121 PI Fl expressing cells) to generate an immune response to the encoded immunogen (for 
review, see Donnelly et ai, 1997, Ann. Rev. Immunol. 15: 617-648). 

The amino acid sequence of a 121 PI Fl protein as shown in Figure 2 or Figure 3 can be analyzed to select 

35 specific regions of the 121P1F1 protein for generating antibodies. For example, hydrophobicity and hydrophilicity 

analyses of a 121P1F1 amino acid sequence are used to identify hydrophilic regions in the 121P1F1 structure. Regions 
of a 121 PI Fl protein that show immunogenic structure, as well as other regions and domains, can readily be identified 
using various other methods known in the art, such as Chou-Fasman, Garnier-Robson, Kyte-Doolittle, Eisenberg, 
Karplus-Schultz or Jameson- Wolf analysis. Hydrophilicity profiles can be generated using the method of Hopp, T.P. 

40 and Woods, K.R., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:3824-3828. Hydropathicity profiles can be generated 

using the method of Kyte, J. and Doolittle, R.F., 1982, J. Mol. Biol. 157:105-132. Percent (%) Accessible Residues 
profiles can be generated using the method of Janin J., 1979, Nature 277:491-492. Average Flexibility profiles can 
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be generated using the method of Bhaskaran R., Ponnuswamy P.K., 1988, Int. J. Pept. Protein Res. 32:242-255. 
Beta-turn profiles can be generated using the method of Deleage, G., Roux B., 1987, Protein Engineering 1 :289- 
294. Thus, each region identified by any of these programs or methods is within the scope of the present invention. 
Methods for the generation of 1 2 1 P 1 F 1 antibodies are further illustrated by way of the examples provided herein. 
5 Methods for preparing a protein or polypeptide for use as an immunogen are well known in the art. Also well known in 
the art are methods for preparing immunogenic conjugates of a protein with a carrier, such as BSA, KLH or other carrier 
protein. In some circumstances, direct conjugation using, for example, carbodiimide reagents are used; in other 
instances linking reagents such as those supplied by Pierce Chemical Co., Rockford, IL, are effective. Administration of 
a 121P1F1 immunogen is often conducted by injection over a suitable time period and with use of a suitable adjuvant, as 
10 is understood in the art. During the immunization schedule, titers of antibodies can be taken to determine adequacy of 
antibody formation. 

121P1F1 monoclonal antibodies can be produced by various means well known in the art. For example, 
immortalized cell lines that secrete a desired monoclonal antibody are prepared using the standard hybridoma 
technology of Kohler and Milstein or modifications that immortalize antibody-producing B cells, as is generally known. 

1 5 Immortalized cell lines that secrete the desired antibodies are screened by immunoassay in which the antigen is a 

121PlFl-related protein. When the appropriate immortalized cell culture is identified, the cells can be expanded and 
antibodies produced either from in vitro cultures or from ascites fluid. 

The antibodies or fragments of the invention can also be produced, by recombinant means. Regions that bind 
specifically to the desired regions of a 121 PI Fl protein can also be produced in the context of chimeric or 

20 complementarity determining region (CDR) grafted antibodies of multiple species origin. Humanized or human 

121P1F1 antibodies can also be produced, and are preferred for use in therapeutic contexts. Methods for humanizing 
murine and other non-human antibodies, by substituting one or more of the non-human antibody CDRs for 
corresponding human antibody sequences, are well known (see for example, Jones et al., 1986, Nature 321 : 522-525; 
Riechmann et al., 1988, Nature 332: 323-327; Verhoeyen et al., 1988, Science 239: 1534-1536). See also, Carter et al., 

25 1993, Proc. Natl. Acad. Sci. USA 89: 4285 and Sims et al., 1993, J. Immunol. 151: 2296. 

Methods for producing fully human monoclonal antibodies include phage display and transgenic methods (for 
review, see Vaughane/ al, 1998, Nature Biotechnology 16:535-539). Fully human 121P1F1 monoclonal antibodies 
can be generated using cloning technologies employing large human Ig gene combinatorial libraries (i.e., phage display) 
(Griffiths and Hoogenboom, Building an in vitro immune system: human antibodies from phage display libraries. In: 

30 Protein Engineering of Antibody Molecules for Prophylactic and Therapeutic Applications in Man, Clark, M. (Ed.), 
Nottingham Academic, pp 45-64 (1993); Burton and Barbas, Human Antibodies from combinatorial libraries. Id., pp 
65-82). Fully human 1 2 1 P 1 F 1 monoclonal antibodies can also be produced using transgenic mice engineered to contain 
human immunoglobulin gene loci as described in PCT Patent Application W098/24893, Kucherlapati and Jakobovits et 
aL, published December 3, 1997 (see also, Jakobovits, 1998, Exp. Opin. Invest. Drugs 7(4): 607-614; U.S. patents 

35 6,162,963 issued 19 December 2000; 6,150,584 issued 12 November 2000; and, 6,1 14598 issued 5 September 2000). 
This method avoids the in vitro manipulation required with phage display technology and efficiently produces high 
affinity authentic human antibodies. 

Reactivity of 12 1 PI F 1 antibodies with an 1 2 1 P 1 F 1 -related protein can be established by a number of well 
known means, including Western blot, immunoprecipitation, ELISA, and FACS analyses using, as appropriate, 

40 1 2 1 P 1 F 1 -related proteins, 1 2 1 P 1 F 1 -expressing cells or extracts thereof. A 1 2 1 P 1 F 1 antibody or fragment thereof 
can be labeled with a detectable marker or conjugated to a second molecule. Suitable detectable markers include, 
but are not limited to, a radioisotope, a fluorescent compound, a bioluminescent compound, chemiluminescent 
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compound, a metal chelator or an enzyme. Further, bi-specific antibodies specific for two or more 1 2 1 P 1 F 1 
epitopes are generated using methods generally known in the art. Homodimeric antibodies can also be generated by 
cross-linking techniques known in the art (e.g., Wolff et aL, Cancer Res. 53: 2560-2565). 

5 V.) 121P1F1 Cellular Immune Responses 

The mechanism by which T cells recognize antigens has been delineated. Efficacious peptide epitope 
vaccine compositions of the invention induce a therapeutic or prophylactic immune responses in very broad 
segments of the world-wide population. For an understanding of the value and efficacy of compositions of the 
invention that induce cellular immune responses, a brief review of immunology-related technology is provided. 

10 A complex of an HLA molecule and a peptidic antigen acts as the ligand recognized by HLA-restricted T 

cells (Buus, S. et aL, Cell 47: 1071, 1986; Babbitt, B. P. et aL, Nature 317:359, 1985; Townsend, A. and Bodmer, 
H., Annu. Rev. Immunol, 7:601, 1989; Germain, R. N., Annu. Rev. Immunol. 1 1:403, 1993). Through the study of 
single amino acid substituted antigen analogs and the sequencing of endogenously bound, naturally processed 
peptides, critical residues that correspond to motifs required for specific binding to HLA antigen molecules have 

15 been identified and are set forth in Table IV (see also, e.g., Southwood, et aL, J. Immunol. 160:3363, 1998; 

Rammensee, et aL, Immunogenetics 41 : 178, 1995; Rammensee et aL, SYFPEITH1, access via World Wide Web at 
URL syfpeithi.bmi-heidelberg.com/; Sette, A. and Sidney, J. Curr. Opin. Immunol. 10:478, 1998; Engelhard, V. H., 
Curr. Opin. Immunol. 6:13, 1994; Sette, A. and Grey, H. M., Curr. Opin. Immunol. 4:79, 1992; Sinigaglia, F. and 
Hammer, J. Curr. Biol. 6:52, 1994; Ruppert et aL, Cell 74:929-937, 1993; Kondo et aL, J. Immunol. 155:4307- 

20 4312, 1995; Sidney et aL, J. Immunol. 157:3480-3490, 1996; Sidney et aL, Human Immunol. 45:79-93, 1996; Sette, 
A. and Sidney, J. Immunogenetics 1999 Nov; 50(3-4):20l-l2 t Review). 

Furthermore, x-ray crystal lographic analyses of HLA-peptide complexes have revealed pockets within the 
peptide binding cleft/groove of HLA molecules which accommodate, in an allele-specific mode, residues borne by 
peptide ligands; these residues in turn determine the HLA binding capacity of the peptides in which they are 

25 present. (See, e.g., Madden, D.R. Annu. Rev. Immunol. 13:587, 1995; Smith, et aL, Immunity 4:203, 1996; Fremont 
et aL, Immunity 8:305, 1998; Stern et aL, Structure 2:245, 1994; Jones, E.Y. Curr. Opin. Immunol. 9:75, 1997; 
Brown, J. H. et aL, Nature 364:33, 1993; Guo, H. C. et aL, Proc. Natl. Acad Sci. USA 90:8053, 1993; Guo, H. C. et 
aL, Nature 360:364, 1992; Silver, M. L. et aL, Nature 360:367, 1992; Matsumura, M. et aL, Science 257:927, 1992; 
Madden et aL, Cell 70: 1035, 1992; Fremont, D. H. et aL, Science 257:919, 1992; Saper, M. A. , Bjorkman, P. J. and 

30 Wiley, D. C, J. Mol. Biol. 2 1 9:277, 1 99 1 .) 

Accordingly, the definition of class I and class II allele-specific HLA binding motifs, or class I or class II 
supermotifs allows identification of regions within a protein that are correlated with binding to particular HLA 
antigen(s). 

Thus, by a process of HLA motif identification, candidates for epitope-based vaccines have been 
35 identified; such candidates can be further evaluated by HLA-peptide binding assays to determine binding affinity 
and/or the time period of association of the epitope and its corresponding HLA molecule. Additional confirmatory 
work can be performed to select, amongst these vaccine candidates, epitopes with preferred characteristics in terms 
of population coverage, and/or immunogenicity. 

Various strategies can be utilized to evaluate cellular immunogenicity, including: 
40 1) Evaluation of primary T cell cultures from normal individuals (see, e.g., Wentworth, P. A. et aL, Mol. 

Immunol. 32:603, 1995; Celis, E. et aL, Proc. Natl. Acad. Sci. USA 91:2105, 1994; Tsai, V. et aL, J. Immunol. 
158:1796, 1997; Kawashima, I. et aL, Human Immunol. 59:1, 1998). This procedure involves the stimulation of 
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peripheral blood lymphocytes (PBL) from normal subjects with a test peptide in the presence of antigen presenting 
cells in vitro over a period of several weeks. T cells specific for the peptide become activated during this time and 
are detected using, e.g., a lymphokine- or 5l Cr-release assay involving peptide sensitized target cells. 

2) Immunization of HLA transgenic mice (see, e.g., Wentworth, P. A. et ai, J. Immunol. 26:97, 1996; 
5 Wentworth, P. A. et ai, Int. Immunol. 8:651, 1996; Alexander, J. et al.,J. Immunol. 159:4753, 1997). For example, 
in such methods peptides in incomplete Freund's adjuvant are administered subcutaneously to HLA transgenic 
mice. Several weeks following immunization, splenocytes are removed and cultured in vitro in the presence of test 
peptide for approximately one week. Peptide-speciflc T cells are detected using, e.g., a 5 ^r-release assay 
involving peptide sensitized target cells and target cells expressing endogenously generated antigen. 

10 3) Demonstration of recall T cell responses from immune individuals who have been either effectively 

vaccinated and/or from chronically ill patients (see, e.g. , Rehermann, B. et al. , J. Exp. Med. 181:1 047, 1 995 ; 
Doolan, D. L. et ai, Immunity 1:97, 1997; Bertoni, R. et ai, J. Clin. Invest. 100:503, 1997; Threlkeld, S. C. et al., J. 
Immunol. 159:1648, 1997; Diepolder, H. M. et al., J. Virol. 71:601 1, 1997). Accordingly, recall responses are 
detected by culturing PBL from subjects that have been exposed to the antigen due to disease and thus have 

15 generated an immune response "naturally", or from patients who were vaccinated against the antigen. PBL from 
subjects are cultured in vitro for 1-2 weeks in the presence of test peptide plus antigen presenting cells (APC) to 
allow activation of "memory" T cells, as compared to "naive" T cells. At the end of the culture period, T cell 
activity is detected using assays including 5 *Cr release involving peptide-sensitized targets, T cell proliferation, or 
lymphokine release. 

20 

VI.) 121P1F1 Transgenic Animals 

Nucleic acids that encode a 121PlFl-related protein can also be used to generate either transgenic animals 
or "knock out" animals that, in turn, are useful in the development and screening of therapeutically useful reagents. 
In accordance with established techniques, cDNA encoding 121 P1F1 can be used to clone genomic DNA that 

25 encodes 121P1F1. The cloned genomic sequences can then be used to generate transgenic animals containing cells 
that express DNA that encode 12 1 PI F 1. Methods for generating transgenic animals, particularly animals such as 
mice or rats, have become conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 
issued 12 April 1988, and 4,870,009 issued 26 September 1989. Typically, particular cells would be targeted for 
121P1F1 transgene incorporation with tissue-specific enhancers. 

30 Transgenic animals that include a copy of a transgene encoding 1 2 1 P 1 F 1 can be used to examine the effect 

of increased expression of DNA that encodes 1 2 1 P 1 F 1 . Such animals can be used as tester animals for reagents 
thought to confer protection from, for example, pathological conditions associated with its overexpression. In 
accordance with this aspect of the invention, an animal is treated with a reagent and a reduced incidence of a 
pathological condition, compared to untreated animals that bear the transgene, would indicate a potential 

35 therapeutic intervention for the pathological condition. 

Alternatively, non-human homologues of 121 PI Fl can be used to construct a 121 PI Fl "knock out" animal 
that has a defective or altered gene encoding 121 PI Fl as a result of homologous recombination between the 
endogenous gene encoding 121P1F1 and altered genomic DNA encoding 121P1F1 introduced into an embryonic 
cell of the animal. For example, cDNA that encodes 12 1 PI Fl can be used to clone genomic DNA encoding 

40 121P1F1 in accordance with established techniques. A portion of the genomic DNA encoding 121 PI Fl can be 
deleted or replaced with another gene, such as a gene encoding a selectable marker that can be used to monitor 
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integration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included in the 
vector (see, e.g., Thomas and Capecchi, Cell , 51*503 (1987) for a description of homologous recombination 
vectors). The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced DNA has homologously recombined with the endogenous DNA are selected (see, e.g., Li et al., Cell, 
5 69:915 (1992)). The selected cells are then injected into a blastocyst of an animal (e.g., a mouse or rat) to form 

aggregation chimeras (see, e.g., Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E. 
J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152). A chimeric embryo can then be implanted into a suitable 
pseudopregnant female foster animal, and the embryo brought to term to create a "knock out" animal. Progeny 
harboring the homologously recombined DNA in their germ cells can be identified by standard techniques and used 
10 to breed animals in which all cells of the animal contain the homologously recombined DNA. Knock out animals 
can be characterized, for example, for their ability to defend against certain pathological conditions or for their 
development of pathological conditions due to absence of a 121P1F1 polypeptide. 

VII.) Methods for the Detection of 121P1F1 

1 5 Another aspect of the present invention relates to methods for detecting 1 2 1 P 1 F 1 polynucleotides and 

121 PI Fl -related proteins, as well as methods for identifying a cell that expresses 121P1F1. The expression profile of 
121P1F1 makes it a diagnostic marker for metastasized disease. Accordingly, the status of 121 PI Fl gene products 
provides information useful for predicting a variety of factors including susceptibility to advanced stage disease, rate of 
progression, and/or tumor aggressiveness. As discussed in detail herein, the status of 121P1F1 gene products in patient 

20 samples can be analyzed by a variety protocols that are well known in the art including immunohistochemical analysis, 
the variety of Northern blotting techniques including in situ hybridization, RT-PCR analysis (for example on laser 
capture micro-dissected samples), Western blot analysis and tissue array analysis. 

More particularly, the invention provides assays for the detection of 121P1F1 polynucleotides in a biological 
sample, such as serum, bone, prostate, and other tissues, urine, semen, cell preparations, and the like. Detectable 

25 121P1F1 polynucleotides include, for example, a 121P1F1 gene or fragment thereof, 121PIF1 mRNA, alternative splice 
variant 121P1F1 mRNAs, and recombinant DNA or RNA molecules that contain a 121 PI Fl polynucleotide. A number 
of methods for amplifying and/or detecting the presence of 1 2 1 P 1 F 1 polynucleotides are well known in the art and can 
be employed in the practice of this aspect of the invention. 

In one embodiment, a method for detecting an 1 2 1 P 1 F 1 mRNA in a biological sample comprises producing 

30 cDNA from the sample by reverse transcription using at least one primer; amplifying the cDNA so produced using 
an 121P1F1 polynucleotides as sense and antisense primers to amplify 121P1F1 cDNAs therein; and detecting the 
presence of the amplified 121P1F1 cDNA. Optionally, the sequence of the amplified 121P1F1 cDNA can be 
determined. 

In another embodiment, a method of detecting a 12 1 PI Fl gene in a biological sample comprises first 
35 isolating genomic DNA from the sample; amplifying the isolated genomic DNA using 121 P1F1 polynucleotides as 
sense and antisense primers; and detecting the presence of the amplified 121 PI Fl gene. Any number of appropriate 
sense and antisense probe combinations can be designed from a 121 PI Fl nucleotide sequence (see, e.g., Figure 2) 
and used for this purpose. 

The invention also provides assays for detecting the presence of an 1 2 1 P 1 F 1 protein in a tissue or other 
40 biological sample such as serum, semen, bone, prostate, urine, cell preparations, and the like. Methods for detecting a 
1 21 P1F1 -related protein are also well known and include, for example, immunoprecipitation, immunohistochemical 
analysis, Western blot analysis, molecular binding assays, ELISA, ELI FA and the like. For example, a method of 
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detecting the presence of a 121 PI Fl -related protein in a biological sample comprises first contacting the sample 
with a 121P1F1 antibody, a 121PlFl-reactive fragment thereof, or a recombinant protein containing an antigen 
binding region of a 121 PI Fl antibody; and then detecting the binding of 121PlFl-reIated protein in the sample. 
Methods for identifying a cell that expresses 12 1 PI Fl are also within the scope of the invention. In one 
5 embodiment, an assay for identifying a cell that expresses a 121P1F1 gene comprises detecting the presence of 121P1F1 
mRNA in the cell. Methods for the detection of particular mRNAs in cells are well known and include, for example, 
hybridization assays using complementary DNA probes (such as in situ hybridization using labeled 121P1F1 
riboprobes, Northern blot and related techniques) and various nucleic acid amplification assays (such as RT-PCR using 
complementary primers specific for 121P1F1, and other amplification type detection methods, such as, for example, 

10 branched DNA, SISBA, TMA and the like). Alternatively, an assay for identifying a cell that expresses a 121 PI Fl gene 
comprises detecting the presence of 1 2 1 P 1 F 1 -related protein in the cell or secreted by the cell. Various methods for the 
detection of proteins are well known in the art and are employed for the detection of 121PlFl-related proteins and cells 
that express 1 2 1 P 1 F 1 -related proteins. 

121P1F1 expression analysis is also useful as a tool for identifying and evaluating agents that modulate 

15 121P1F1 gene expression. For example, 121P1F1 expression is significantly upregulated in prostate cancer, and is 
expressed in cancers of the tissues listed in Table I. Identification of a molecule or biological agent that inhibits 
121P1F1 expression or over-expression in cancer cells is of therapeutic value. For example, such an agent can be 
identified by using a screen that quantifies 121P1F1 expression by RT-PCR, nucleic acid hybridization or antibody 
binding. 

20 

VIII.) Methods for Monitoring the Status of 121 PI Fl -related Genes and Their Products 

Oncogenesis is known to be a multistep process where cellular growth becomes progressively dysregulated 
and cells progress from a normal physiological state to precancerous and then cancerous states (see, e.g., Alers et 
aL, Lab Invest. 77(5): 437-438 (1997) and Isaacs et aL, Cancer Surv. 23: 19-32 (1995)). In this context, examining 

25 a biological sample for evidence of dysregulated cell growth (such as aberrant 121P1F1 expression in cancers) 

allows for early detection of such aberrant physiology, before a pathologic state such as cancer has progressed to a 
stage that therapeutic options are more limited and or the prognosis is worse. In such examinations, the status of 
1 2 1 P 1 F 1 in a biological sample of interest can be compared, for example, to the status of 1 2 1 P 1 F 1 in a 
corresponding normal sample (e.g. a sample from that individual or alternatively another individual that is not 

30 affected by a pathology). An alteration in the status of 1 2 1 P 1 F 1 in the biological sample (as compared to the 

normal sample) provides evidence of dysregulated cellular growth. In addition to using a biological sample that is 
not affected by a pathology as a normal sample, one can also use a predetermined normative value such as a 
predetermined normal level of mRNA expression (see, e.g., Grever et aL t J. Comp. Neurol. 1996 Dec 9; 376(2): 
306-14 and U.S. Patent No. 5,837,501) to compare 121P1F1 status in a sample. 

35 The term "status" in this context is used according to its art accepted meaning and refers to the condition or 

state of a gene and its products. Typically, skilled artisans use a number of parameters to evaluate the condition or state 
of a gene and its products. These include, but are not limited to the location of expressed gene products (including the 
location of 1 2 1 P 1 F 1 expressing cells) as well as the level, and biological activity of expressed gene products (such as 
121P1F1 mRNA, polynucleotides and polypeptides). Typically, an alteration in the status of 121 PI Fl comprises a 

40 change in the location of 121 PI Fl and/or 121P1F1 expressing cells and/or an increase in 121P1F1 mRNA and/or 
protein expression. 
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121 PI Fl status in a sample can be analyzed by a number of means well known in the art, including without 
limitation, immunohistochemical analysis, in situ hybridization, RT-PCR analysis on laser capture micro-dissected 
samples, Western blot analysis, and tissue array analysis. Typical protocols for evaluating the status of a 121P1F1 gene 
and gene products are found, for example in Ausubel et al eds., 1995, Current Protocols In Molecular Biology, Units 
5 2 (Northern Blotting), 4 (Southern Blotting), 15 (Immunoblotting) and 18 (PCR Analysis). Thus, the status of 
121P1F1 in a biological sample is evaluated by various methods utilized by skilled artisans including, but not 
limited to genomic Southern analysis (to examine, for example perturbations in a 12 1 PI Fl gene), Northern analysis 
and/or PCR analysis of 121 PI Fl mRNA (to examine, for example alterations in the polynucleotide sequences or 
expression levels of 121 PI Fl mRNAs), and, Western and/or immunohistochemical analysis (to examine, for 
10 example alterations in polypeptide sequences, alterations in polypeptide localization within a sample, alterations in 
expression levels of 121 PI Fl proteins and/or associations of 121 PI Fl proteins with polypeptide binding partners). 
Detectable 121P1F1 polynucleotides include, for example, a 121P1F1 gene or fragment thereof, 121P1F1 mRNA, 
alternative splice variants, 1 2 1 P 1 F 1 mRNAs, and recombinant DNA or RNA molecules containing a 1 2 1 PI F 1 
polynucleotide, 

1 5 The expression profile of 1 2 1 P 1 F 1 makes it a diagnostic marker for local and/or metastasized disease, and 

provides information on the growth or oncogenic potential of a biological sample. In particular, the status of 121P1F1 
provides information useful for predicting susceptibility to particular disease stages, progression, and/or tumor 
aggressiveness. The invention provides methods and assays for determining 1 2 1 P 1 F 1 status and diagnosing cancers that 
express 121P1F1, such as cancers of the tissues listed in Table I. For example, because 121P1F1 mRNA is so highly 

20 expressed in prostate and other cancers relative to normal prostate tissue, assays that evaluate the levels of 1 2 1 P 1 F 1 
mRNA transcripts or proteins in a biological sample can be used to diagnose a disease associated with 121P1F1 
dysregulation, and can provide prognostic information useful in defining appropriate therapeutic options. 

The expression status of 121 PI Fl provides information including the presence, stage and location of 
dysplastic, precancerous and cancerous cells, predicting susceptibility to various stages of disease, and/or for gauging 

25 tumor aggressiveness. Moreover, the expression profile makes it useful as an imaging reagent for metastasized 
disease. Consequently, an aspect of the invention is directed to the various molecular prognostic and diagnostic 
methods for examining the status of 1 2 1 P 1 F 1 in biological samples such as those from individuals suffering from, or 
suspected of suffering from a pathology characterized by dysregulated cellular growth, such as cancer. 

As described above, the status of 121P1F1 in a biological sample can be examined by a number of well- 

30 known procedures in the art. For example, the status of 121 PI Fl in a biological sample taken from a specific 

location in the body can be examined by evaluating the sample for the presence or absence of 121 PI Fl expressing 
cells (e.g. those that express 12 1 PI Fl mRNAs or proteins). This examination can provide evidence of dysregulated 
cellular growth, for example, when 12 1 PI Fl -expressing cells are found in a biological sample that does not 
normally contain such cells (such as a lymph node), because such alterations in the status of 121 PI Fl in a biological 

35 sample are often associated with dysregulated cellular growth. Specifically, one indicator of dysregulated cellular 
growth is the metastases of cancer cells from an organ of origin (such as the prostate) to a different area of the body 
(such as a lymph node). In this context, evidence of dysregulated cellular growth is important for example because 
occult lymph node metastases can be detected in a substantial proportion of patients with prostate cancer, and such 
metastases are associated with known predictors of disease progression (see, e.g., Murphy et a/., Prostate 42(4): 

40 315-317 (200O);Su et aL, Semin. Surg. Oncol. 18(1): 17-28 (2000) and Freeman et al., J Urol 1995 Aug 154(2 Pt 
l):474-8). 
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In one aspect, the invention provides methods for monitoring 121P1F1 gene products by determining the 
status of 121 PI Fl gene products expressed by cells from an individual suspected of having a disease associated 
with dysregulated cell growth (such as hyperplasia or cancer) and then comparing the status so determined to the 
status of 121 PI Fl gene products in a corresponding normal sample. The presence of aberrant 12 1 P 1 Fl gene 
5 products in the test sample relative to the normal sample provides an indication of the presence of dysregulated cell 
growth within the cells of the individual. 

In another aspect, the invention provides assays useful in determining the presence of cancer in an 
individual, comprising detecting a significant increase in 12 1 PI Fl mRNA or protein expression in a test cell or 
tissue sample relative to expression levels in the corresponding normal cell or tissue. The presence of 121 PI Fl 
10 mRNA can, for example, be evaluated in tissue samples including but not limited to those listed in Table I. The 
presence of significant 121P1F1 expression in any of these tissues is useful to indicate the emergence, presence 
and/or severity of a cancer, since the corresponding normal tissues do not express 121P1F1 mRNA or express it at 
lower levels. 

In a related embodiment, 121P1F1 status is determined at the protein level rather than at the nucleic acid level. 

15 For example, such a method comprises determining the level of 121P1F1 protein expressed by cells in a test tissue 

sample and comparing the level so determined to the level of 1 2 1 P 1 F 1 expressed in a corresponding normal sample. In 
one embodiment, the presence of 121 PI Fl protein is evaluated, for example, using immunohistochemical methods. 
1 2 1 P 1 F 1 antibodies or binding partners capable of detecting 1 2 1 P 1 F 1 protein expression are used in a variety of assay 
formats well known in the art for this purpose. 

20 In a further embodiment, one can evaluate the status of 1 2 1 P 1 F 1 nucleotide and amino acid sequences in a 

biological sample in order to identify perturbations in the structure of these molecules. These perturbations can include 
insertions, deletions, substitutions and the like. Such evaluations are useful because perturbations in the nucleotide and 
amino acid sequences are observed in a large number of proteins associated with a growth dysregulated phenotype (see, 
e.g., Marrogi et al. 9 1999, J. Cutan. Pathol. 26(8):369-378). For example, a mutation in the sequence of 121 PI Fl may 

25 be indicative of the presence or promotion of a tumor. Such assays therefore have diagnostic and predictive value where 
a mutation in 121P1F1 indicates a potential loss of function or increase in tumor growth. 

A wide variety of assays for observing perturbations in nucleotide and amino acid sequences are well known in 
the art. For example, the size and structure of nucleic acid or amino acid sequences of 121 PI Fl gene products are 
observed by the Northern, Southern, Western, PCR and DN A sequencing protocols discussed herein. In addition, other 

30 methods for observing perturbations in nucleotide and amino acid sequences such as single strand conformation 

polymorphism analysis are well known in the art (see, e.g., U.S. Patent Nos. 5,382,510 issued 7 September 1999, and 
5,952,170 issued 17 January 1995). 

Additionally, one can examine the methylation status of a 1 2 1 P 1 F 1 gene in a biological sample. Aberrant 
demethylation and/or hypermethylation of CpG islands in gene 5' regulatory regions frequently occurs in immortalized 

35 and transformed cells, and can result in altered expression of various genes. For example, promoter hypermethylation 
of the pi-class glutathione S-transferase (a protein expressed in normal prostate but not expressed in >90% of 
prostate carcinomas) appears to permanently silence transcription of this gene and is the most frequently detected 
genomic alteration in prostate carcinomas (De Marzo et a!., Am. J. Pathol. 155(6): 1985-1992 (1999)). In addition, 
this alteration is present in at least 70% of cases of high-grade prostatic intraepithelial neoplasia (PIN) (Brooks et 

40 a/., Cancer Epidemiol. Biomarkers Prev., 1 998, 7:53 1 -536). In another example, expression of the LAGE-I tumor 
specific gene (which is not expressed in normal prostate but is expressed in 25-50% of prostate cancers) is induced 
by deoxy-azacytidine in lymphoblastoid cells, suggesting that tumoral expression is due to demethylation (Lethe et 
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al. 9 Int. J. Cancer 76(6): 903-908 (1998)). A variety of assays for examining methylation status of a gene are well 
known in the art. For example, one can utilize, in Southern hybridization approaches, methylation-sensitive restriction 
enzymes that cannot cleave sequences that contain methylated CpG sites to assess the methylation status of CpG islands. 
In addition, MSP (methylation specific PCR) can rapidly profile the methylation status of all the CpG sites present in a 
5 CpG island of a given gene. This procedure involves initial modification of DNA by sodium bisulfite (which will 
convert all unmethylated cytosines to uracil) followed by amplification using primers specific for methylated versus 
unmethylated DNA. Protocols involving methylation interference can also be found for example in Current Protocols 
In Molecular Biology, Unit 12, Frederick M. Ausubel et al eds., 1995. 

Gene amplification is an additional method for assessing the status of 121 PI FL Gene amplification is 

10 measured in a sample directly, for example, by conventional Southern blotting or Northern blotting to quantitate the 
transcription of mRNA (Thomas, 1980, Proc. Natl. Acad. Sci. USA, 77:5201-5205), dot blotting (DNA analysis), or 
in situ hybridization, using an appropriately labeled probe, based on the sequences provided herein. Alternatively, 
antibodies are employed that recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA 
hybrid duplexes or DNA-protein duplexes. The antibodies in turn are labeled and the assay carried out where the 

1 5 duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to 
the duplex can be detected. 

Biopsied tissue or peripheral blood can be conveniently assayed for the presence of cancer cells using for 
example, Northern, dot blot or RT-PCR analysis to detect 121P1F1 expression. The presence of RT-PCR amplifiable 
121P1F1 mRNA provides an indication of the presence of cancer. RT-PCR assays are well known in the art. RT-PCR 

20 detection assays for tumor cells in peripheral blood are currently being evaluated for use in the diagnosis and 

management of a number of human solid tumors. In the prostate cancer field, these include RT-PCR assays for the 
detection of cells expressing PSA and PSM (Verkaik et al, 1997, Urol. Res. 25:373-384; Ghossein et al, 1995, J. Clin. 
Oncol. 13:1195-2000; Hestone/a/., 1995, Clin. Chem. 41 : 1687- 1688). 

A further aspect of the invention is an assessment of the susceptibility that an individual has for developing 

25 cancer. In one embodiment, a method for predicting susceptibility to cancer comprises detecting 121P1F1 mRNA or 
121P1F1 protein in a tissue sample, its presence indicating susceptibility to cancer, wherein the degree of 121P1F1 
mRNA expression correlates to the degree of susceptibility. In a specific embodiment, the presence of 121 PI Fl in 
prostate or other tissue is examined, with the presence of 121 PI Fl in the sample providing an indication of prostate 
cancer susceptibility (or the emergence or existence of a prostate tumor). Similarly, one can evaluate the integrity 

30 121P1 Fl nucleotide and amino acid sequences in a biological sample, in order to identify perturbations in the structure 
of these molecules such as insertions, deletions, substitutions and the like. The presence of one or more perturbations in 
121 PI Fl gene products in the sample is an indication of cancer susceptibility (or the emergence or existence of a 
tumor). 

The invention also comprises methods for gauging tumor aggressiveness. In one embodiment, a method for 
35 gauging aggressiveness of a tumor comprises determining the level of 121 PI Fl mRNA or 121 PI Fl protein expressed 
by tumor cells, comparing the level so determined to the level of 121 PI Fl mRNA or 121P1F1 protein expressed in a 
corresponding normal tissue taken from the same individual or a normal tissue reference sample, wherein the degree of 
1 2 1 P 1 F 1 mRNA or 1 2 1 P 1 F 1 protein expression in the tumor sample relative to the normal sample indicates the degree 
of aggressiveness. In a specific embodiment, aggressiveness of a tumor is evaluated by determining the extent to which 
40 1 2 1 P 1 F 1 is expressed in the tumor cells, with higher expression levels indicating more aggressive tumors. Another 

embodiment is the evaluation of the integrity of 121P1F1 nucleotide and amino acid sequences in a biological sample, in 
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order to identify perturbations in the structure of these molecules such as insertions, deletions, substitutions and the like. 
The presence of one or more perturbations indicates more aggressive tumors. 

Another embodiment of the invention is directed to methods for observing the progression of a malignancy in 
an individual over time. In one embodiment, methods for observing the progression of a malignancy in an individual 
5 overtime comprise determining the level of 12 IP 1F1 mRNA or 121 PI Fl protein expressed by cells in a sample of the 
tumor, comparing the level so determined to the level of 12 1 PI F 1 mRNA or 1 2 1 P 1 F 1 protein expressed in an 
equivalent tissue sample taken from the same individual at a different time, wherein the degree of 121 PI Fl mRNA or 
121P1F1 protein expression in the tumor sample over time provides information on the progression of the cancer. In a 
specific embodiment, the progression of a cancer is evaluated by determining 121P1F1 expression in the tumor cells 

1 0 over time, where increased expression over time indicates a progression of the cancer. Also, one can evaluate the 

integrity 121P1F1 nucleotide and amino acid sequences in a biological sample in order to identify perturbations in the 
structure of these molecules such as insertions, deletions, substitutions and the like, where the presence of one or more 
perturbations indicates a progression of the cancer. 

The above diagnostic approaches can be combined with any one of a wide variety of prognostic and diagnostic 

1 5 protocols known in the art. For example, another embodiment of the invention is directed to methods for observing a 
coincidence between the expression of 1 2 1 P 1 F 1 gene and 1 2 1 P 1 F 1 gene products (or perturbations in 1 2 1 P 1 F 1 gene 
and 121P1F1 gene products) and a factor that is associated with malignancy, as a means for diagnosing and 
prognosticating the status of a tissue sample. A wide variety of factors associated with malignancy can be utilized, such 
as the expression of genes associated with malignancy (e.g. PSA, PSCA and PSM expression for prostate cancer etc.) as 

20 well as gross cytological observations (see, e.g., Bocking et al. y 1984, Anal. Quant. Cytol. 6(2):74-88; Epstein, 1995, 
Hum. Pathol. 26(2):223-9; Thorson et ai, 1998, Mod. Pathol. 1 1(6):543-51 ; Baisden et al. y 1999, Am. J. Surg. 
Pathol. 23(8):9 18-24). Methods for observing a coincidence between the expression of 121 PI Fl gene and 121 PI Fl 
gene products (or perturbations in 121 PI Fl gene and 121 PI Fl gene products) and another factor that is associated with 
malignancy are useful, for example, because the presence of a set of specific factors that coincide with disease provides 

25 information crucial for diagnosing and prognosticating the status of a tissue sample. 

In one embodiment, methods for observing a coincidence between the expression of 121 PI Fl gene and 
1 2 1 P 1 F 1 gene products (or perturbations in 1 2 1 P 1 F 1 gene and 1 2 1 P 1 F 1 gene products) and another factor associated 
with malignancy entails detecting the overexpression of 121P1F1 mRNA or protein in a tissue sample, detecting the 
overexpression of PSA mRNA or protein in a tissue sample (or PSCA or PSM expression), and observing a coincidence 

30 of 121P1F1 mRNA or protein and PSA mRNA or protein overexpression (or PSCA or PSM expression). In a specific 
embodiment, the expression of 121 PI Fl and PSA mRNA in prostate tissue is examined, where the coincidence of 
121P1F1 and PSA mRNA overexpression in the sample indicates the existence of prostate cancer, prostate cancer 
susceptibility or the emergence or status of a prostate tumor. 

Methods for detecting and quantifying the expression of 1 2 1 P 1 F 1 mRNA or protein are described herein, and 

35 standard nucleic acid and protein detection and quantification technologies are well known in the art. Standard methods 
for the detection and quantification of 121 PI Fl mRNA include in situ hybridization using labeled 121 PI Fl riboprobes, 
Northern blot and related techniques using 121P1F1 polynucleotide probes, RT-PCR analysis using primers specific for 
1 2 1 P I F 1 , and other amplification type detection methods, such as, for example, branched DN A, SISBA, TMA and the 
like. In a specific embodiment, semi-quantitative RT-PCR is used to detect and quantify 121P1F1 mRNA expression. 

40 Any number of primers capable of amplifying 121 PI Fl can be used for this purpose, including but not limited to the 
various primer sets specifically described herein. In a specific embodiment, polyclonal or monoclonal antibodies 
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specifically reactive with the wild-type 121P1F1 protein can be used in an immunohistochemical assay of biopsied 
tissue. 

IX.) Identification of Molecules That Interact With 121P1F1 
5 The 121P1F1 protein and nucleic acid sequences disclosed herein allow a skilled artisan to identify proteins, 

small molecules and other agents that interact with 121P1F1, as well as pathways activated by 121 PI Fl via any one 
of a variety of art accepted protocols. For example, one can utilize one of the so-called interaction trap systems 
(also referred to as the "two-hybrid assay"). In such systems, molecules interact and reconstitute a transcription 
factor which directs expression of a reporter gene, whereupon the expression of the reporter gene is assayed. Other 
10 systems identify protein-protein interactions in vivo through reconstitution of a eukaryotic transcriptional activator, 
see, e.g., U.S. Patent Nos. 5,955,280 issued 21 September 1999, 5,925,523 issued 20 July 1999, 5,846,722 issued 8 
December 1998 and 6,004,746 issued 21 December 1999. Algorithms are also available in the art for genome- 
based predictions of protein function (see, e.g., Marcotte, et al, Nature 402: 4 November 1999, 83-86). 

Alternatively one can screen peptide libraries to identify molecules that interact with 121P1F1 protein 
15 sequences. In such methods, peptides that bind to 121 PI Fl are identified by screening libraries that encode a 

random or controlled collection of amino acids. Peptides encoded by the libraries are expressed as fusion proteins 
of bacteriophage coat proteins, the bacteriophage particles are then screened against the 121P1F1 protein(s). 

Accordingly, peptides having a wide variety of uses, such as therapeutic, prognostic or diagnostic reagents, 
are thus identified without any prior information on the structure of the expected ligand or receptor molecule. 
20 Typical peptide libraries and screening methods that can be used to identify molecules that interact with 121P1F1 
protein sequences are disclosed for example in U.S. Patent Nos. 5,723,286 issued 3 March 1998 and 5,733,731 
issued 31 March 1998. 

Alternatively, cell lines that express 121P1F1 are used to identify protein -protein interactions mediated by 
121P1F1. Such interactions can be examined using immunoprecipitation techniques (see, e.g., Hamilton B.J., et al 

25 Biochem. Biophys. Res. Commun. 1999, 261:646-51). 121P1F1 protein can be immunoprecipitated from 121P1F1- 
expressing cell lines using anti-121PlFl antibodies. Alternatively, antibodies against His-tag can be used in a cell 
line engineered to express fusions of 1 2 1 P 1 F 1 and a His-tag (vectors mentioned above). The immunoprecipitated 
complex can be examined for protein association by procedures such as Western blotting, 35 S-methionine labeling 
of proteins, protein microsequencing, silver staining and two-dimensional gel electrophoresis. 

30 Small molecules and ligands that interact with 121P1F1 can be identified through related embodiments of 

such screening assays. For example, small molecules can be identified that interfere with protein function, 
including molecules that interfere with 121P1FTS ability to mediate phosphorylation and de-phosphorylation, 
interaction with DN A or RN A molecules as an indication of regulation of cell cycles, second messenger signaling 
or tumorigenesis. Similarly, small molecules that modulate 121 PI Fl -related ion channel, protein pump, or cell 

35 communication functions are identified and used to treat patients that have a cancer that expresses 121P1F1 (see, 

e.g., Hille, B., Ionic Channels of Excitable Membranes 2 nd Ed., Sinauer Assoc., Sunderland, MA, 1992). Moreover, 
ligands that regulate 121P1F1 function can be identified based on their ability to bind 12IP1F1 and activate a 
reporter construct. Typical methods are discussed for example in U.S. Patent No. 5,928,868 issued 27 July 1999, 
and include methods for forming hybrid ligands in which at least one ligand is a small molecule. In an illustrative 

40 embodiment, cells engineered to express a fusion protein of 121 PI Fl and a DNA-binding protein are used to co- 
express a fusion protein of a hybrid ligand/small molecule and a cDNA library transcriptional activator protein. 
The cells further contain a reporter gene, the expression of which is conditioned on the proximity of the first and 
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second fusion proteins to each other, an event that occurs only if the hybrid ligand binds to target sites on both 
hybrid proteins. Those cells that express the reporter gene are selected and the unknown small molecule or the 
unknown ligand is identified. This method provides a means of identifying modulators which activate or inhibit 
121P1F1. 

5 An embodiment of this invention comprises a method of screening for a molecule that interacts with an 

121P1F1 amino acid sequence shown in Figure 2 or Figure 3, comprising the steps of contacting a population of 
molecules with a 1 2 1 P 1 F 1 amino acid sequence, allowing the population of molecules and the 1 2 1 P 1 F 1 amino acid 
sequence to interact under conditions that facilitate an interaction, determining the presence of a molecule that 
interacts with the 121P1F1 amino acid sequence, and then separating molecules that do not interact with the 
10 121 PI Fl amino acid sequence from molecules that do. In a specific embodiment, the method further comprises 
purifying, characterizing and identifying a molecule that interacts with the 121 PI Fl amino acid sequence. The 
identified molecule can be used to modulate a function performed by 121 PI Fl . In a preferred embodiment, the 
121P1F1 amino acid sequence is contacted with a library of peptides. 

15 X.) Therapeutic Methods and Compositions 

The identification of 121 PI Fl as a protein that is normally expressed in a restricted set of tissues, but 
which is also expressed in prostate and other cancers, opens a number of therapeutic approaches to the treatment of 
such cancers. As contemplated herein, 121 PI Fl functions as a transcription factor involved in activating tumor- 
promoting genes or repressing genes that block tumorigenesis. 

20 Accordingly, therapeutic approaches that inhibit the activity of a 121 P1F1 protein are useful for patients 

suffering from a cancer that expresses 12 1 PI Fl . These therapeutic approaches generally fall into two classes. One 
class comprises various methods for inhibiting the binding or association of a 121P1F1 protein with its binding 
partner or with other proteins. Another class comprises a variety of methods for inhibiting the transcription of a 
121P1F1 gene or translation of 121P1F1 mRNA. 

25 X.A.) Anti-Cancer Vaccines 

The invention provides cancer vaccines comprising a 121PlFl-related protein or 121PlFl-related nucleic acid. 
In view of the expression of 12 1 PI Fl, cancer vaccines prevent and/or treat 121 PI Fl -expressing cancers with minimal 
or no effects on non-target tissues. The use of a tumor antigen in a vaccine that generates humoral and/or cell-mediated 
immune responses as anti-cancer therapy is well known in the art and has been employed in prostate cancer using 

30 human PSMA and rodent PAP immunogens (Hodge et al., 1995, Int. J. Cancer 63:231-237; Fong et al. y 1997, J. 
Immunol. 159:3113-3117). 

Such methods can be readily practiced by employing a 121PlFl-reIated protein, or an 121PlFl-encoding 
nucleic acid molecule and recombinant vectors capable of expressing and presenting the 121 PI Fl immunogen 
(which typically comprises a number of antibody or T cell epitopes). Skilled artisans understand that a wide variety 

35 of vaccine systems for delivery of immunoreactive epitopes are known in the art (see, e.g., Heryln et aL y Ann Med 
1999 Feb 3 l(l):66-78; Maruyama et al, Cancer Immunol Immunother 2000 Jun 49(3): 123-32) Briefly, such 
methods of generating an immune response (e.g. humoral and/or cell-mediated) in a mammal, comprise the steps of: 
exposing the mammal's immune system to an immunoreactive epitope (e.g. an epitope present in a 121 PI Fl protein 
shown in Figure 3 or analog or homolog thereof) so that the mammal generates an immune response that is specific 

40 for that epitope (e.g. generates antibodies that specifically recognize that epitope). In a preferred method, a 

121P1F1 immunogen contains a biological motif, see e.g., Tables V-XVIII, XXVI, and XXVII, or a peptide of a 
size range from 121P1FI indicated in Figure 5, Figure 6, Figure 7, Figure 8, and Figure 9. 
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The entire 121P1F1 protein, immunogenic regions or epitopes thereof can be combined and delivered by 
various means. Such vaccine compositions can include, for example, lipopeptides (eg.,Vitiello, A. et al., J. Clin. 
Invest. 95:341, 1995), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ("PLG") microspheres 
(see, e.g., Eldridge, et al., Molec. Immunol. 28:287-294, 1991: Alonso et al, Vaccine 12:299-306, 1994; Jones et 
5 al, Vaccine 13:675-681, 1995), peptide compositions contained in immune stimulating complexes (ISCOMS) (see, 
e.g., Takahashi et ai, Nature 344:873-875, 1990; Hu et al, Clin Exp Immunol. 1 13:235-243, 1998), multiple 
antigen peptide systems (MAPs) (see e.g., Tarn, J. P., Proc. Natl. Acad Sci. U.S.A. 85:5409-5413, 1988; Tarn, J.P., 
J. Immunol. Methods 196:17-32, 1996), peptides formulated as multivalent peptides; peptides for use in ballistic 
delivery systems, typically crystallized peptides, viral delivery vectors (Perkus, M. E. et al, In: Concepts in vaccine 

10 development, Kaufmann, S. H. E., ed., p. 379, 1996; Chakrabarti, S. et al., Nature 320:535, 1986; Hu, S. L. et al., 
Nature 320:537, 1986; Kieny, M.-P. et ai, AIDS Bio/Technology 4:790, 1986; Top, F. H. et al., J. Infect. Dis. 
124:148, 1971; Chanda, P. K. et ai, Virology 175:535, 1990), particles of viral or synthetic origin (e.g., Kofler, N. 
et ai, J. Immunol. Methods. 192:25, 1996; Eldridge, J. H. et ai, Sem. Hematol. 30:16, 1993; Falo, L. D., Jr. et ai, 
Nature Med. 7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol. 4:369, 

15 1986; Gupta, R. K. et ai, Vaccine 1 1:293, 1993), liposomes (Reddy, R. etal.,J. Immunol. 148:1585, 1992; Rock, 
K. L., Immunol. Today 17:131, 1996), or, naked or particle absorbed cDNA (Ulmer, J. B. et al., Science 259:1745, 
1993; Robinson, H. L., Hunt, L. A., and Webster, R. G., Vaccine 1 1:957, 1993; Shiver, J. W. et ai, In: Concepts in 
vaccine development, Kaufmann, S. H. E., ed., p. 423, 1996; Cease, K. B., and Berzofsky, J. A., Annu. Rev. 
Immunol. 12:923, 1994 and Eldridge, J. H. et ai, Sem. Hematol. 30:16, 1993). Toxin-targeted delivery 

20 technologies, also known as receptor mediated targeting, such as those of Avant Immunotherapeutics, Inc. 
(Needham, Massachusetts) may also be used. 

In patients with 121 PI Fl -associated cancer, the vaccine compositions of the invention can also be used in 
conjunction with other treatments used for cancer, e.g., surgery, chemotherapy, drug therapies, radiation therapies, 
etc. including use in combination with immune adjuvants such as IL-2, IL-12, GM-CSF, and the like. 

25 Cellular Vaccines: 

CTL epitopes can be determined using specific algorithms to identify peptides within 121P1F1 protein that 
bind corresponding HLA alleles (see e.g., Table IV; Epimer™ and Epimatrix™, Brown University (TJRL located on the 
World Wide Web at w^vw.brown.edu/Research/TB-HIV_Lab/epimatrix/epmiatrix.htmO and, BIMAS, (URL 
bimas.dcrt.nih.gov/; SYFPEITHI at URL syfpeithi.bmi-heidelberg.com/). In a preferred embodiment, a 121P1F1 

30 immunogen contains one or more amino acid sequences identified using techniques well known in the art, such as the 
sequences shown in Tables V-XVIII, XXVI, and XXVII or a peptide of 8, 9, 10 or 1 1 amino acids specified by an 
HLA Class I motif/supermotif (e.g., Table IV (A), Table IV (D), or Table IV (E)) and/or a peptide of at least 9 
amino acids that comprises an HLA Class II motif/supermotif (e.g., Table IV (B) or Table IV (C)). As is 
appreciated in the art, the HLA Class I binding groove is essentially closed ended so that peptides of only a 

35 particular size range can fit into the groove and be bound, generally HLA Class I epitopes are 8, 9, 10, or 1 1 amino 
acids long. In contrast, the HLA Class II binding groove is essentially open ended; therefore a peptide of about 9 or 
more amino acids can be bound by an HLA Class II molecule. Due to the binding groove differences between HLA 
Class I and II, HLA Class I motifs are length specific, i.e., position two of a Class I motif is the second amino acid 
in an amino to carboxyl direction of the peptide. The amino acid positions in a Class II motif are relative only to 

40 each other, not the overall peptide, i.e., additional amino acids can be attached to the amino and/or carboxyl termini 
of a motif-bearing sequence. HLA Class II epitopes are often 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, or 25 amino acids long, or longer than 25 amino acids. 
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Antibody-based Vaccines 

A wide variety of methods for generating an immune response in a mammal are known in the art (for 
example as the first step in the generation of hybridomas). Methods of generating an immune response in a 
mammal comprise exposing the mammal's immune system to an immunogenic epitope on a protein (e.g. a 121 PI Fl 
5 protein) so that an immune response is generated. A typical embodiment consists of a method for generating an 
immune response to 121P1F1 in a host, by contacting the host with a sufficient amount of at least one 12 1 PI Fl B 
cell or cytotoxic T-cell epitope or analog thereof; and at least one periodic interval thereafter re-contacting the host 
with the 121P1F1 B cell or cytotoxic T-cell epitope or analog thereof. A specific embodiment consists of a method 
of generating an immune response against a 121 PI Fl -related protein or a man-made multiepitopic peptide 

1 0 comprising: administering 12 1 PI Fl immunogen (e.g. a 1 2 1 P 1 F 1 protein or a peptide fragment thereof, an 1 2 1 P 1 F 1 
fusion protein or analog etc.) in a vaccine preparation to a human or another mammal. Typically, such vaccine 
preparations further contain a suitable adjuvant (see, e.g., U.S. Patent No. 6,146,635) or a universal helper epitope 
such as a PADRE™ peptide (Epimmune Inc., San Diego, CA; see, e.g., Alexander et aL, J. Immunol. 2000 164(3); 
164(3): 1625-1633; Alexanders aL, Immunity 1994 1(9): 751-761 and Alexander et ai, Immunol. Res. 1998 

15 18(2): 79-92). An alternative method comprises generating an immune response in an individual against a 121P1F1 
immunogen by: administering in vivo to muscle or skin of the individuals body a DNA molecule that comprises a 
DNA sequence that encodes an 121P1F1 immunogen, the DNA sequence operatively linked to regulatory sequences 
which control the expression of the DNA sequence; wherein the DNA molecule is taken up by cells, the DNA 
sequence is expressed in the cells and an immune response is generated against the immunogen (see, e.g., U.S. 

20 Patent No. 5,962,428). Optionally a genetic vaccine facilitator such as anionic lipids; saponins; lectins; estrogenic 
compounds; hydroxylated lower alkyls; dimethyl sulfoxide; and urea is also administered. In addition, an 
antiidiotype antibody can be administered that mimics 121P1F1, in order to generate a response to the target 
antigen. 

Nucleic Acid Vaccines: 

25 Vaccine compositions of the invention include nucleic acid-mediated modalities. DNA or RNA that 

encode protein(s) of the invention can be administered to a patient. Genetic immunization methods can be 
employed to generate prophylactic or therapeutic humoral and cellular immune responses directed against cancer 
cells expressing 121P1F1. Constructs comprising DNAencodinga 121 PI Fl -related protein/immunogen and 
appropriate regulatory sequences can be injected directly into muscle or skin of an individual, such that the cells of 

30 the muscle or skin take-up the construct and express the encoded 121P1F1 protein/immunogen. Alternatively, a 

vaccine comprises a 121PlFl-reIated protein. Expression of the 121PlFl-related protein immunogen results in the 
generation of prophylactic or therapeutic humoral and cellular immunity against cells that bear a 121 PI Fl protein. 
Various prophylactic and therapeutic genetic immunization techniques known in the art can be used (for review, see 
information and references published at Internet address located on the W orld Wide Web at www.genweb.com). 

35 Nucleic acid-based delivery is described, for instance, in Wolff et. ai, Science 247: 1465 (1990) as well as U.S. 

Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,1 18; 5,736,524; 5,679,647; WO 98/04720. Examples of DNA- 
based delivery technologies include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery (see, e.g., U.S. Patent 
No. 5,922,687). 

40 For therapeutic or prophylactic immunization purposes, proteins of the invention can be expressed via viral 

or bacterial vectors. Various viral gene delivery systems that can be used in the practice of the invention include, but 
are not limited to, vaccinia, fowlpox, canarypox, adenovirus, influenza, poliovirus, adeno-associated virus, lentivirus, 

40 
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and sindbis virus (see, e.g., Restifo, 1996, Curr. Opin. Immunol. 8:658-663; Tsang et al J. Natl. Cancer Inst. 87:982-990 
(1995)). Non- viral delivery systems can also be employed by introducing naked DNA encoding a 121 PI Fl -related 
protein into the patient (e.g., intramuscularly or intradermal ly) to induce an anti-tumor response. 

Vaccinia virus is used, for example, as a vector to express nucleotide sequences that encode the peptides of 
5 the invention. Upon introduction into a host, the recombinant vaccinia virus expresses the protein immunogenic 
peptide, and thereby elicits a host immune response. Vaccinia vectors and methods useful in immunization 
protocols are described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). 
BCG vectors are described in Stover et al., Nature 351:456-460 (1991). A wide variety of other vectors useful for 
therapeutic administration or immunization of the peptides of the invention, e.g. adeno and adeno-associated virus 

10 vectors, retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, will be apparent 
to those skilled in the art from the description herein. 

Thus, gene delivery systems are used to deliver a 121 PI Fl -related nucleic acid molecule. In one embodiment, 
the full-length human 121P1F1 cDNA is employed. In another embodiment, 121P1F1 nucleic acid molecules encoding 
specific cytotoxic T lymphocyte (CTL) and/or antibody epitopes are employed. 

15 Ex Vivo Vaccines 

Various ex vivo strategies can also be employed to generate an immune response. One approach involves the 
use of antigen presenting cells (APCs) such as dendritic cells (DC) to present 121 PI Fl antigen to a patient's immune 
system. Dendritic cells express MHC class I and II molecules, B7 co-stimulator, and IL-12, and are thus highly 
specialized antigen presenting cells. In prostate cancer, autologous dendritic cells pulsed with peptides of the 

20 prostate-specific membrane antigen (PSMA) are being used in a Phase I clinical trial to stimulate prostate cancer 
patients' immune systems (Tjoa et al., 1996, Prostate 28:65-69; Murphy et al., 1996, Prostate 29:371-380). Thus, 
dendritic cells can be used to present 121 PI Fl peptides to T cells in the context of MHC class I or II molecules. In 
one embodiment, autologous dendritic cells are pulsed with 121P1F1 peptides capable of binding to MHC class I 
and/or class II molecules. In another embodiment, dendritic cells are pulsed with the complete 121P1F1 protein. 

25 Yet another embodiment involves engineering the overexpression of a 121 PI Fl gene in dendritic cells using 

various implementing vectors known in the art, such as adenovirus (Arthur et al., 1997, Cancer Gene Ther. 4:17- 
25), retrovirus (Henderson et al., 1996, Cancer Res. 56:3763-3770), lentivirus, adeno-associated virus, DNA 
transfection (Ribas et al., 1997, Cancer Res. 57:2865-2869), or tumor-derived RNA transfection (Ashley et al., 
1997, J. Exp. Med. 186:1 177-1 182). Cells that express 121P1F1 can also be engineered to express immune 

30 modulators, such as GM-CSF, and used as immunizing agents. 

X.B.) 121P1F1 as a Target for Antibodv-based Therapy 

121P1F1 is an attractive target for antibody-based therapeutic strategies. A number of antibody strategies 
are known in the art for targeting both extracellular and intracellular molecules (see, e.g., complement and ADCC 
mediated killing as well as the use of intrabodies). Because 1 2 1 P 1 F 1 is expressed by cancer cells of various 

35 lineages relative to corresponding normal cells, systemic administration of 121PlFl-immunoreactive compositions 
are prepared that exhibit excellent sensitivity without toxic, non-specific and/or non-target effects caused by binding 
of the immunoreactive composition to non-target organs and tissues. Antibodies specifically reactive with domains 
of 121 PI Fl are useful to treat 1 2 1 P 1 F 1 -expressing cancers systemically, either as conjugates with a toxin or 
therapeutic agent, or as naked antibodies capable of inhibiting cell proliferation or function. 

40 1 2 1 P 1 F 1 antibodies can be introduced into a patient such that the antibody binds to 1 2 1 P 1 F 1 and 

modulates a function, such as an interaction with a binding partner, and consequently mediates destruction of the 
tumor cells and/or inhibits the growth of the tumor cells. Mechanisms by which such antibodies exert a therapeutic 
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effect can include complement-mediated cytolysis, antibody-dependent cellular cytotoxicity, modulation of the 
physiological function of 121 PI Fl, inhibition of ligand binding or signal transduction pathways, modulation of 
tumor cell differentiation, alteration of tumor angiogenesis factor profiles, and/or apoptosis. 

Those skilled in the art understand that antibodies can be used to specifically target and bind immunogenic 
5 molecules such as an immunogenic region of a 12 1P1 Fl sequence shown in Figure 2 or Figure 3. In addition, 
skilled artisans understand that it is routine to conjugate antibodies to cytotoxic agents (see, e.g., Slevers et ai 
Blood 93: 1 1 3678-3684 (June 1 , 1999)). When cytotoxic and/or therapeutic agents are delivered directly to cells, 
such as by conjugating them to antibodies specific for a molecule expressed by that cell (e.g. 121P1F1), the 
cytotoxic agent will exert its known biological effect (i.e. cytotoxicity) on those cells. 

10 A wide variety of compositions and methods for using antibody-cytotoxic agent conjugates to kill cells are 

known in the art. In the context of cancers, typical methods entail administering to an animal having a tumor a 
biologically effective amount of a conjugate comprising a selected cytotoxic and/or therapeutic agent linked to a 
targeting agent (e.g. an anti- 1 2 1 P 1 F 1 antibody) that binds to a marker (e.g. 1 2 1 P 1 F 1 ) expressed, accessible to 
binding or localized on the cell surfaces. A typical embodiment is a method of delivering a cytotoxic and/or 

15 therapeutic agent to a cell expressing 121P1F1, comprising conjugating the cytotoxic agent to an antibody that 

immunospeciflcally binds to a 121 PI Fl epitope, and, exposing the cell to the antibody-agent conjugate. Another 
illustrative embodiment is a method of treating an individual suspected of suffering from metastasized cancer, 
comprising a step of administering parenterally to said individual a pharmaceutical composition comprising a 
therapeutically effective amount of an antibody conjugated to a cytotoxic and/or therapeutic agent. 

20 Cancer immunotherapy using anti- 1 2 1 P 1 F 1 antibodies can be done in accordance with various approaches 

that have been successfully employed in the treatment of other types of cancer, including but not limited to colon 
cancer (Arlen et ai, 1998, Crit. Rev. Immunol. 18:133-138), multiple myeloma (Ozaki et aL, 1997, Blood 90:3179- 
3 186, Tsunenari et aL, 1997, Blood 90:2437-2444), gastric cancer (Kasprzyk et al y 1992, Cancer Res. 52:2771- 
2776), B-cell lymphoma (Funakoshi et al, 1996, J. Immunother. Emphasis Tumor Immunol. 19:93-101), leukemia 

25 (Zhong et al. 9 1996, Leuk. Res. 20:581-589), colorectal cancer (Moun et ai, 1994, Cancer Res. 54:6160-6166; 

Velderse/a/., 1995, Cancer Res. 55:4398-4403), and breast cancer (Shepard et al., 1991, J. Clin. Immunol. 11:117- 
127). Some therapeutic approaches involve conjugation of naked antibody to a toxin or radioisotope, such as the 
conjugation of Y 9i or I 131 to anti-CD20 antibodies (e.g., Zevalin™, IDEC Pharmaceuticals Corp. or Bexxar™, 
Coulter Pharmaceuticals), while others involve co-administration of antibodies and other therapeutic agents, such as 

30 Herceptin™ (trastuzumab) with paclitaxel (Genentech, Inc.). The antibodies can be conjugated to a therapeutic 

agent. To treat prostate cancer, for example, 121 PI Fl antibodies can be administered in conjunction with radiation, 
chemotherapy or hormone ablation. Also, antibodies can be conjugated to a toxin such as calicheamicin (e.g., 
Mylotarg™, Wyeth-Ayerst, Madison, NJ, a recombinant humanized IgG 4 kappa antibody conjugated to antitumor 
antibiotic calicheamicin) or a maytansinoid (e.g., taxane-based Tumor-Activated Prodrug, TAP, platform, 

35 ImmunoGen, Cambridge, MA, also see e.g., US Patent 5,416,064). 

Although 121 PI Fl antibody therapy is useful for all stages of cancer, antibody therapy can be particularly 
appropriate in advanced or metastatic cancers. Treatment with the antibody therapy of the invention is indicated for 
patients who have received one or more rounds of chemotherapy. Alternatively, antibody therapy of the invention 
is combined with a chemotherapeutic or radiation regimen for patients who have not received chemotherapeutic 

40 treatment. Additionally, antibody therapy can enable the use of reduced dosages of concomitant chemotherapy, 

particularly for patients who do not tolerate the toxicity of the chemotherapeutic agent very well. Fan et al. (Cancer 
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Res. 53:4637-4642, 1993), Prewett et al. (International J. of Onco. 9:217-224, 1996), and Hancock et al. (Cancer 
Res. 51:4575-4580, 1991) describe the use of various antibodies together with chemotherapeutic agents. 

Although 121P1F1 antibody therapy is useful for all stages of cancer, antibody therapy can be particularly 
appropriate in advanced or metastatic cancers. Treatment with the antibody therapy of the invention is indicated for 
5 patients who have received one or more rounds of chemotherapy. Alternatively, antibody therapy of the invention 
is combined with a chemotherapeutic or radiation regimen for patients who have not received chemotherapeutic 
treatment. Additionally, antibody therapy can enable the use of reduced dosages of concomitant chemotherapy, 
particularly for patients who do not tolerate the toxicity of the chemotherapeutic agent very well. 

Cancer patients can be evaluated for the presence and level of 121 PI Fl expression, preferably using 
10 immunohistochemical assessments of tumor tissue, quantitative 12 1 PI Fl imaging, or other techniques that reliably 
indicate the presence and degree of 121 PI Fl expression. Immunohistochemical analysis of tumor biopsies or 
surgical specimens is preferred for this purpose. Methods for immunohistochemical analysis of tumor tissues are 
well known in the art. 

Anti-121P1F1 monoclonal antibodies that treat prostate and other cancers include those that initiate a 
1 5 potent immune response against the tumor or those that are directly cytotoxic. In this regard, anti- 1 2 1 P 1 F 1 

monoclonal antibodies (mAbs) can elicit tumor cell lysis by either complement-mediated or antibody-dependent 
cell cytotoxicity (ADCC) mechanisms, both of which require an intact Fc portion of the immunoglobulin molecule 
for interaction with effector cell Fc receptor sites on complement proteins. In addition, anti-121 PI Fl mAbs that 
exert a direct biological effect on tumor growth are useful to treat cancers that express 12 1 PI Fl , Mechanisms by 
20 which directly cytotoxic mAbs act include: inhibition of cell growth, modulation of cellular differentiation, 

modulation of tumor angiogenesis factor profiles, and the induction of apoptosis. The mechanism(s) by which a 
particular anti-121 PI Fl mAb exerts an anti-tumor effect is evaluated using any number of in vitro assays that 
evaluate cell death such as ADCC, ADMMC, complement-mediated cell lysis, and so forth, as is generally known 
in the art. 

25 In some patients, the use of murine or other non-human monoclonal antibodies, or human/mouse chimeric 

mAbs can induce moderate to strong immune responses against the non-human antibody. This can result in 
clearance of the antibody from circulation and reduced efficacy. In the most severe cases, such an immune 
response can lead to the extensive formation of immune complexes which, potentially, can cause renal failure. 
Accordingly, preferred monoclonal antibodies used in the therapeutic methods of the invention are those that are 

30 either fully human or humanized and that bind specifically to the target 121P1F1 antigen with high affinity but 
exhibit low or no antigenicity in the patient. 

Therapeutic methods of the invention contemplate the administration of single anti- 121 PI Fl mAbs as well 
as combinations, or cocktails, of different mAbs. Such mAb cocktails can have certain advantages inasmuch as 
they contain mAbs that target different epitopes, exploit different effector mechanisms or combine directly 

35 cytotoxic mAbs with mAbs that rely on immune effector functionality. Such mAbs in combination can exhibit 
synergistic therapeutic effects. In addition, anti-121 PI Fl mAbs can be administered concomitantly with other 
therapeutic modalities, including but not limited to various chemotherapeutic agents, androgen-blockers, immune 
modulators (e.g., 1L-2, GM-CSF), surgery or radiation. The anti-121 PI Fl mAbs are administered in their "naked" 
or unconjugated form, or can have a therapeutic agent(s) conjugated to them. 

40 Anti-121 P1F1 antibody formulations are administered via any route capable of delivering the antibodies to 

a tumor cell. Routes of administration include, but are not limited to, intravenous, intraperitoneal, intramuscular, 
intratumor, intradermal, and the like. Treatment generally involves repeated administration of the anti-121 PI Fl 
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antibody preparation, via an acceptable route of administration such as intravenous injection (IV), typically at a 
dose in the range of about 0.1, .2, .3, .4, .5, .6, .7, .8, .9., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, or 25 mg/kg body weight. 
In general, doses in the range of 10-1000 mg mAb per week are effective and well tolerated. 

Based on clinical experience with the Herceptin™ mAb in the treatment of metastatic breast cancer, an 
5 initial loading dose of approximately 4 mg/kg patient body weight IV, followed by weekly doses of about 2 mg/kg 
IV of the anti-121PlFl mAb preparation represents an acceptable dosing regimen. Preferably, the initial loading 
dose is administered as a 90 minute or longer infusion. The periodic maintenance dose is administered as a 30 
minute or longer infusion, provided the initial dose was well tolerated. As appreciated by those of skill in the art, 
various factors can influence the ideal dose regimen in a particular case. Such factors include, for example, the 
1 0 binding affinity and half life of the Ab or mAbs used, the degree of 1 2 1 P 1 F 1 expression in the patient, the extent of 
circulating shed 121P1F1 antigen, the desired steady-state antibody concentration level, frequency of treatment, and 
the influence of chemotherapeutic or other agents used in combination with the treatment method of the invention, 
as well as the health status of a particular patient. 

Optionally, patients should be evaluated for the levels of 121 PI Fl in a given sample (e.g. the levels of 
15 circulating 121P1F1 antigen and/or 121P1F1 expressing cells) in order to assist in the determination of the most 
effective dosing regimen, etc. Such evaluations are also used for monitoring purposes throughout therapy, and are 
useful to gauge therapeutic success in combination with the evaluation of other parameters (for example, urine 
cytology and/or ImmunoCyt levels in bladder cancer therapy, or by analogy, serum PSA levels in prostate cancer 
therapy). 

20 Anti-idiotypic anti-121PlFl antibodies can also be used in anti-cancer therapy as a vaccine for inducing an 

immune response to cells expressing a 121 PI Fl -related protein. In particular, the generation of anti-idiotypic 
antibodies is well known in the art; this methodology can readily be adapted to generate anti-idiotypic anti-121PlFl 
antibodies that mimic an epitope on a 121PlFl-related protein (see, for example, Wagner et aL, 1997, Hybridoma 
16: 33-40; Foon et al. y 1995, J. Clin. Invest. 96:334-342; Herlyn et al, 1996, Cancer Immunol. Immunother. 43:65- 

25 76). Such an anti-idiotypic antibody can be used in cancer vaccine strategies. 

XX.) 121P1F1 as a Target for Cellular Immune Responses 

Vaccines and methods of preparing vaccines that contain an immunogenically effective amount of one or 
more HLA-binding peptides as described herein are further embodiments of the invention. Furthermore, vaccines 
in accordance with the invention encompass compositions of one or more of the claimed peptides. A peptide can be 

30 present in a vaccine individually. Alternatively, the peptide can exist as a homopolymer comprising multiple copies 
of the same peptide, or as a heteropolymer of various peptides. Polymers have the advantage of increased 
immunological reaction and, where different peptide epitopes are used to make up the polymer, the additional 
ability to induce antibodies and/or CTLs that react with different antigenic determinants of the pathogenic organism 
or tumor-related peptide targeted for an immune response. The composition can be a naturally occurring region of 

35 an antigen or can be prepared, e.g., recombinantly or by chemical synthesis. 

Carriers that can be used with vaccines of the invention are well known in the art, and include, e.g., 
thyroglobulin, albumins such as human serum albumin, tetanus toxoid, polyamino acids such as poly L-lysine, poly 
L-glutamic acid, influenza, hepatitis B virus core protein, and the like. The vaccines can contain a physiologically 
tolerable (i.e., acceptable) diluent such as water, or saline, preferably phosphate buffered saline. The vaccines also 

40 typically include an adjuvant. Adjuvants such as incomplete Freund's adjuvant, aluminum phosphate, aluminum 
hydroxide, or alum are examples of materials well known in the art. Additionally, as disclosed herein, CTL 
responses can be primed by conjugating peptides of the invention to lipids, such as tripalmitoyl-S- 
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glycerylcysteinlyseryl- serine (P 3 CSS). Moreover, an adjuvant such as a synthetic cytosine-phosphorothiolated- 
guanine-containing (CpG) oligonucleotides has been found to increase CTL responses 10- to 100-fold, (see, e.g. 
Davila and Celis J. Immunol. 165:539-547 (2000)) 

Upon immunization with a peptide composition in accordance with the invention, via injection, aerosol, 
5 oral, transdermal, transmucosal, intrapleural, intrathecal, or other suitable routes, the immune system of the host 
responds to the vaccine by producing large amounts of CTLs and/or HTLs specific for the desired antigen. 
Consequently, the host becomes at least partially immune to later development of cells that express or overexpress 
121P1F1 antigen, or derives at least some therapeutic benefit when the antigen was tumor-associated. 

In some embodiments, it may be desirable to combine the class I peptide components with components 

10 that induce or facilitate neutralizing antibody and or helper T cell responses directed to the target antigen. A 
preferred embodiment of such a composition comprises class I and class II epitopes in accordance with the 
invention. An alternative embodiment of such a composition comprises a class I and/or class II epitope in 
accordance with the invention, along with a cross reactive HTL epitope such as PADRE™ (Epimmune, San Diego, 
CA) molecule (described e.g., in U.S. Patent Number 5,736,142). 

15 A vaccine of the invention can also include antigen-presenting cells (APC), such as dendritic cells (DC), as 

a vehicle to present peptides of the invention. Vaccine compositions can be created in vitro, following dendritic cell 
mobilization and harvesting, whereby loading of dendritic cells occurs in vitro. For example, dendritic cells are 
transfected, e.g., with a minigene in accordance with the invention, or are pulsed with peptides. The dendritic cell 
can then be administered to a patient to elicit immune responses in vivo. Vaccine compositions, either DN A- or 

20 peptide-based, can also be administered in vivo in combination with dendritic cell mobilization whereby loading of 
dendritic cells occurs in vivo. 

Preferably, the following principles are utilized when selecting an array of epitopes for inclusion in a 
polyepitopic composition for use in a vaccine, or for selecting discrete epitopes to be included in a vaccine and/or to 
be encoded by nucleic acids such as a minigene. It is preferred that each of the following principles be balanced in 

25 order to make the selection. The multiple epitopes to be incorporated in a given vaccine composition may be, but 
need not be, contiguous in sequence in the native antigen from which the epitopes are derived. 

1 . ) Epitopes are selected which, upon administration, mimic immune responses that have been 
observed to be correlated with tumor clearance. For HLA Class I this includes 3-4 epitopes that come from at least 
one tumor associated antigen (TAA). For HLA Class II a similar rationale is employed; again 3-4 epitopes are 

30 selected from at least one TAA (see, e.g., Rosenberg et aL, Science 278: 1447-1450). Epitopes from one TAA may 
be used in combination with epitopes from one or more additional TAAs to produce a vaccine that targets tumors 
with varying expression patterns of frequently-expressed TAAs. 

2. ) Epitopes are selected that have the requisite binding affinity established to be correlated with 
immunogenicity: for HLA Class I an IC 50 of 500 nM or less, often 200 nM or less; and for Class II an IC 50 of 1000 

35 nM or less. 

3. ) Sufficient supermotif bearing-peptides, or a sufficient array of allele-specific motif-bearing 
peptides, are selected to give broad population coverage. For example, it is preferable to have at least 80% 
population coverage. A Monte Carlo analysis, a statistical evaluation known in the art, can be employed to assess 
the breadth, or redundancy of, population coverage. 

40 4.) When selecting epitopes from cancer-related antigens it is often useful to select analogs because 

the patient may have developed tolerance to the native epitope. 
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5. ) Of particular relevance are epitopes referred to as "nested epitopes." Nested epitopes occur where 
at least two epitopes overlap in a given peptide sequence. A nested peptide sequence can comprise B cell, HLA 
class I and/or HLA class II epitopes. When providing nested epitopes, a general objective is to provide the greatest 
number of epitopes per sequence. Thus, an aspect is to avoid providing a peptide that is any longer than the amino 

5 terminus of the amino terminal epitope and the carboxyl terminus of the carboxyl terminal epitope in the peptide. 
When providing a multi-epitopic sequence, such as a sequence comprising nested epitopes, it is generally important 
to screen the sequence in order to insure that it does not have pathological or other deleterious biological properties. 

6. ) If a polyepitopic protein is created, or when creating a minigene, an objective is to generate the 
smallest peptide that encompasses the epitopes of interest. This principle is similar, if not the same as that 

10 employed when selecting a peptide comprising nested epitopes. However, with an artificial polyepitopic peptide, 
the size minimization objective is balanced against the need to integrate any spacer sequences between epitopes in 
the polyepitopic protein. Spacer amino acid residues can, for example, be introduced to avoid junctional epitopes 
(an epitope recognized by the immune system, not present in the target antigen, and only created by the man-made 
juxtaposition of epitopes), or to facilitate cleavage between epitopes and thereby enhance epitope presentation. 

15 Junctional epitopes are generally to be avoided because the recipient may generate an immune response to that non- 
native epitope. Of particular concern is a junctional epitope that is a "dominant epitope." A dominant epitope may 
lead to such a zealous response that immune responses to other epitopes are diminished or suppressed. 

7. ) Where the sequences of multiple variants of the same target protein are present, potential peptide 
epitopes can also be selected on the basis of their conservancy. For example, a criterion for conservancy may 

20 define that the entire sequence of an HLA class I binding peptide or the entire 9-mer core of a class II binding 
peptide be conserved in a designated percentage of the sequences evaluated for a specific protein antigen. 
X.C.I. Minigene Vaccines 
A number of different approaches are available which allow simultaneous delivery of multiple epitopes. 
Nucleic acids encoding the peptides of the invention are a particularly useful embodiment of the invention. 
25 Epitopes for inclusion in a minigene are preferably selected according to the guidelines set forth in the previous 
section. A preferred means of administering nucleic acids encoding the peptides of the invention uses minigene 
constructs encoding a peptide comprising one or multiple epitopes of the invention. 

The use of multi-epitope minigenes is described below and in, Ishioka et al, J. Immunol 162:3915-3925, 
1999; An, L. and Whitton, J. L., J. Virol 71:2292, 1997; Thomson, S. A. et al,J. Immunol 157:822, 1996; 
30 Whitton, J. L. et al 9 J. Virol 67:348, 1993; Hanke, R. et al, Vaccine 16:426, 1998. For example, a multi-epitope 

DNA plasmid encoding supermotif- and/or motif-bearing epitopes derived 121P1F1, the PADRE® universal helper 
T cell epitope (or multiple HTL epitopes from 121P1F1), and an endoplasmic reticulum-translocating signal 
sequence can be engineered. A vaccine may also comprise epitopes that are derived from other TAAs. 

The immunogenicity of a multi-epitopic minigene can be confirmed in transgenic mice to evaluate the 
35 magnitude of CTL induction responses against the epitopes tested. Further, the immunogenicity of DNA-encoded 
epitopes in vivo can be correlated with the in vitro responses of specific CTL lines against target cells transfected 
with the DNA plasmid. Thus, these experiments can show that the minigene serves to both: 1.) generate a CTL 
response and 2.) that the induced CTLs recognized cells expressing the encoded epitopes. 

For example, to create a DNA sequence encoding the selected epitopes (minigene) for expression in human 
40 cells, the amino acid sequences of the epitopes may be reverse translated. A human codon usage table can be used 
to guide the codon choice for each amino acid. These epitope-encoding DNA sequences may be directly adjoined, 
so that when translated, a continuous polypeptide sequence is created. To optimize expression and/or 
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immunogenicity, additional elements can be incorporated into the minigene design. Examples of amino acid 
sequences that can be reverse translated and included in the minigene sequence include: HLA class I epitopes, HLA 
class II epitopes, antibody epitopes, a ubiquitination signal sequence, and/or an endoplasmic reticulum targeting 
signal. In addition, HLA presentation of CTL and HTL epitopes may be improved by including synthetic (eg. 
5 poly-alanine) or naturally-occurring flanking sequences adjacent to the CTL or HTL epitopes; these larger peptides 
comprising the epitope(s) are within the scope of the invention. 

The minigene sequence may be converted to DNA by assembling oligonucleotides that encode the plus 
and minus strands of the minigene. Overlapping oligonucleotides (30-100 bases long) may be synthesized, 
phosphorylated, purified and annealed under appropriate conditions using well known techniques. The ends of the 

10 oligonucleotides can be joined, for example, using T4 DNA ligase. This synthetic minigene, encoding the epitope 
polypeptide, can then be cloned into a desired expression vector. 

Standard regulatory sequences well known to those of skill in the art are preferably included in the vector 
to ensure expression in the target cells. Several vector elements are desirable: a promoter with a down-stream 
cloning site for minigene insertion; a polyadenylation signal for efficient transcription termination; an E. coli origin 

15 of replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). Numerous promoters can 
be used for this purpose, eg., the human cytomegalovirus (hCMV) promoter. See, eg., U.S. Patent Nos. 5,580,859 
and 5,589,466 for other suitable promoter sequences. 

Additional vector modifications may be desired to optimize minigene expression and immunogenicity. In 
some cases, introns are required for efficient gene expression, and one or more synthetic or naturally-occurring 

20 introns could be incorporated into the transcribed region of the minigene. The inclusion of mRNA stabilization 
sequences and sequences for replication in mammalian cells may also be considered for increasing minigene 
expression. 

Once an expression vector is selected, the minigene is cloned into the polylinker region downstream of the 
promoter. This plasmid is transformed into an appropriate £. coli strain, and DNA is prepared using standard 
25 techniques. The orientation and DNA sequence of the minigene, as well as all other elements included in the 

vector, are confirmed using restriction mapping and DNA sequence analysis. Bacterial cells harboring the correct 
plasmid can be stored as a master cell bank and a working cell bank. 

In addition, immunostimulatory sequences (ISSs or CpGs) appear to play a role in the immunogenicity of 
DNA vaccines. These sequences may be included in the vector, outside the minigene coding sequence, if desired to 
30 enhance immunogenicity. 

In some embodiments, a bi-cistronic expression vector which allows production of both the minigene- 
encoded epitopes and a second protein (included to enhance or decrease immunogenicity) can be used. Examples 
of proteins or polypeptides that could beneficially enhance the immune response if co-expressed include cytokines 
(eg., IL-2, IL-12, GM-CSF), cytokine-inducing molecules (eg., LeIF), costimulatory molecules, or for HTL 
35 responses, pan-DR binding proteins (PADRE™, Epimmune, San Diego, CA). Helper (HTL) epitopes can be joined 
to intracellular targeting signals and expressed separately from expressed CTL epitopes; this allows direction of the 
HTL epitopes to a cell compartment different than that of the CTL epitopes. If required, this could facilitate more 
efficient entry of HTL epitopes into the HLA class II pathway, thereby improving HTL induction. In contrast to 
HTL or CTL induction, specifically decreasing the immune response by co-expression of immunosuppressive 
40 molecules (eg. TGF-(3) may be beneficial in certain diseases. 

Therapeutic quantities of plasmid DNA can be produced for example, by fermentation in £. coli, followed 
by purification. Aliquots from the working cell bank are used to inoculate growth medium, and grown to saturation 
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in shaker flasks or a bioreactor according to well-known techniques. Plasmid DN A can be purified using standard 
bioseparation technologies such as solid phase anion-exchange resins supplied by QIAGEN, Inc. (Valencia, 
California). If required, supercoiled DNA can be isolated from the open circular and linear forms using gel 
electrophoresis or other methods. 

Purified plasmid DNA can be prepared for injection using a variety of formulations. The simplest of these 
is reconstitution of lyophilized DNA in sterile phosphate-buffer saline (PBS). This approach, known as "naked 
DNA," is currently being used for intramuscular (IM) administration in clinical trials. To maximize the 
immunotherapeutic effects of minigene DNA vaccines, an alternative method for formulating purified plasmid 
DNA may be desirable. A variety of methods have been described, and new techniques may become available. 
Cationic lipids, glycolipids, and fusogenic liposomes can also be used in the formulation (see, e.g., as described by 
WO 93/24640; Mannino & Gould-Fogerite, BioTechniques 6(7): 682 (1988); U.S. Pat No. 5,279,833; WO 
91/06309; and Feigner, et ai, Proc. Nat 'I Acad. Sci. USA 84:7413 (1987). In addition, peptides and compounds 
referred to collectively as protective, interactive, non-condensing compounds (PINC) could also be complexed to 
purified plasmid DNA to influence variables such as stability, intramuscular dispersion, or trafficking to specific 
organs or cell types. 

Target cell sensitization can be used as a functional assay for expression and HLA class I presentation of 
minigene-encoded CTL epitopes. For example, the plasmid DNA is introduced into a mammalian cell line that is 
suitable as a target for standard CTL chromium release assays. The transfection method used will be dependent on 
the final formulation. Electroporation can be used for "naked" DNA, whereas cationic lipids allow direct in vitro 
transfection. A plasmid expressing green fluorescent protein (GFP) can be co-trans fected to allow enrichment of 
transfected cells using fluorescence activated cell sorting (FACS). These cells are then chromium-51 ( 5, Cr) labeled 
and used as target cells for epitope-specific CTL lines; cytolysis, detected by 5l Cr release, indicates both production 
of, and HLA presentation of, minigene-encoded CTL epitopes. Expression of HTL epitopes may be evaluated in an 
analogous manner using assays to assess HTL activity. 

In vivo immunogenicity is a second approach for functional testing of minigene DNA formulations. 
Transgenic mice expressing appropriate human HLA proteins are immunized with the DNA product. The dose and 
route of administration are formulation dependent {e.g., IM for DNA in PBS, intraperitoneal (i.p.) for lipid- 
complexed DNA). Twenty-one days after immunization, splenocytes are harvested and restimulated for one week 
in the presence of peptides encoding each epitope being tested. Thereafter, for CTL effector cells, assays are 
conducted for cytolysis of peptide-loaded, 5, Cr-labeled target cells using standard techniques. Lysis of target cells 
that were sensitized by HLA loaded with peptide epitopes, corresponding to minigene-encoded epitopes, 
demonstrates DNA vaccine function for in vivo induction of CTLs. Immunogenicity of HTL epitopes is confirmed 
in transgenic mice in an analogous manner. 

Alternatively, the nucleic acids can be administered using ballistic delivery as described, for instance, in 
U.S. Patent No. 5,204,253. Using this technique, particles comprised solely of DNA are administered. In a further 
alternative embodiment, DNA can be adhered to particles, such as gold particles. 

Minigenes can also be delivered using other bacterial or viral delivery systems well known in the art, e.g., 
an expression construct encoding epitopes of the invention can be incorporated into a viral vector such as vaccinia. 
X.C.2. Combinations of CTL Peptides with Helper Peptides 

Vaccine compositions comprising CTL peptides of the invention can be modified, e.g., analoged, to 
provide desired attributes, such as improved serum half life, broadened population coverage or enhanced 
immunogenicity. 
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For instance, the ability of a peptide to induce CTL activity can be enhanced by linking the peptide to a 
sequence which contains at least one epitope that is capable of inducing a T helper cell response. Although a CTL 
peptide can be directly linked to a T helper peptide, often CTL epitope/HTL epitope conjugates are linked by a 
spacer molecule. The spacer is typically comprised of relatively small, neutral molecules, such as amino acids or 
amino acid mimetics, which are substantially uncharged under physiological conditions. The spacers are typically 
selected from, e.g., Ala, Gly, or other neutral spacers of nonpolar amino acids or neutral polar amino acids. It will 
be understood that the optionally present spacer need not be comprised of the same residues and thus may be a 
hetero- or homo-oligomer. When present, the spacer will usually be at least one or two residues, more usually three 
to six residues and sometimes 10 or more residues. The CTL peptide epitope can be linked to the T helper peptide 
epitope either directly or via a spacer either at the amino or carboxy terminus of the CTL peptide. The amino 
terminus of either the immunogenic peptide or the T helper peptide may be acylated. 

In certain embodiments, the T helper peptide is one that is recognized by T helper cells present in a 
majority of a genetically diverse population. This can be accomplished by selecting peptides that bind to many, 
most, or all of the HLA class II molecules. Examples of such amino acid bind many HLA Class II molecules 
include sequences from antigens such as tetanus toxoid at positions 830-843 (QYIKANSKFIG1TE; SEQ ID NO: 
26 ) , Plasmodium falciparum circumsporozoite (CS) protein at positions 378-398 

(DIEKK1AKMEKASSVFNVVNS; SEQ ID NO: 27 \ and Streptococcus 18kD protein at positions 116-131 

(GAVDSILGGVATYGAA; SEQ ID NO: 28 V Other examples include peptides bearing a DR 1-4-7 

supermotif, or either of the DR3 motifs. 

Alternatively, it is possible to prepare synthetic peptides capable of stimulating T helper lymphocytes, in a 
loosely HLA-restricted fashion, using amino acid sequences not found in nature {see, e.g., PCT publication WO 
95/07707). These synthetic compounds called Pan-DR-binding epitopes {e.g., PADRE™, Epimmune, Inc., San 
Diego, CA) are designed to most preferably bind most HLA-DR (human HLA class II) molecules. For instance, a 
pan-DR-binding epitope peptide having the formula: aKXVAA WTLKAAa (SEQ ID NO: 29=), where "X" is 
either cyclohexylalanine, phenylalanine, or tyrosine, and a is either D-alanine or L-alanine, has been found to bind to 
most HLA-DR alleles, and to stimulate the response of T helper lymphocytes from most individuals, regardless of 
their HLA type. An alternative of a pan-DR binding epitope comprises all "L" natural amino acids and can be 
provided in the form of nucleic acids that encode the epitope. 

HTL peptide epitopes can also be modified to alter their biological properties. For example, they can be 
modified to include D-amino acids to increase their resistance to proteases and thus extend their serum half life, or 
they can be conjugated to other molecules such as lipids, proteins, carbohydrates, and the like to increase their 
biological activity. For example, a T helper peptide can be conjugated to one or more palmitic acid chains at either 
the amino or carboxyl termini. 

X.C.3. Combinations of CTL Peptides with T Cell Priming Agents 
In some embodiments it may be desirable to include in the pharmaceutical compositions of the invention at 
least one component which primes B lymphocytes or T lymphocytes. Lipids have been identified as agents capable 
of priming CTL in vivo. For example, palmitic acid residues can be attached to the e-and a- amino groups of a 
lysine residue and then linked, e.g., via one or more linking residues such as Gly, Gly-Gly-, Ser, Ser-Ser, or the like, 
to an immunogenic peptide. The lipidated peptide can then be administered either directly in a micelle or particle, 
incorporated into a liposome, or emulsified in an adjuvant, e.g., incomplete Freund's adjuvant. In a preferred 
embodiment, a particularly effective immunogenic composition comprises palmitic acid attached to e- and a- amino 
groups of Lys, which is attached via linkage, e.g., Ser-Ser, to the amino terminus of the immunogenic peptide. 
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As another example of lipid priming of CTL responses, E. coli lipoproteins, such as tripalmitoyl-S- 
glycerylcysteinlyseryl- serine (P 3 CSS) can be used to prime virus specific CTL when covalently attached to an 
appropriate peptide (see, e.g., Deres, et aL, Nature 342:561, 1989). Peptides of the invention can be coupled to 
P 3 CSS, for example, and the lipopeptide administered to an individual to specifically prime an immune response to 
5 the target antigen. Moreover, because the induction of neutralizing antibodies can also be primed with P 3 CSS- 
conjugated epitopes, two such compositions can be combined to more effectively elicit both humoral and cell- 
mediated responses. 

X.C.4. Vaccine Compositions Comprising DC Pulsed with CTL and/or HTL Peptides 

An embodiment of a vaccine composition in accordance with the invention comprises ex vivo 

10 administration of a cocktail of epitope-bearing peptides to PBMC, or isolated DC therefrom, from the patient's 
blood. A pharmaceutical to facilitate harvesting of DC can be used, such as Progenipoietin™ (Pharmacia- 
Monsanto, St. Louis, MO) or GM-CSF/IL-4. After pulsing the DC with peptides and prior to reinfusion into 
patients, the DC are washed to remove unbound peptides. In this embodiment, a vaccine comprises peptide-pulsed 
DCs which present the pulsed peptide epitopes complexed with HLA molecules on their surfaces. 

1 5 The DC can be pulsed ex vivo with a cocktail of peptides, some of which stimulate CTL responses to 

121P1F1. Optionally, a helper T cell (HTL) peptide, such as a natural or artificial loosely restricted HLA Class II 
peptide, can be included to facilitate the CTL response. Thus, a vaccine in accordance with the invention is used to 
treat a cancer which expresses or overexpresses 1 2 1 P 1 F 1 . 
X.D. Adoptive Immunotherapy 

20 Antigenic 121PlFl-related peptides are used to elicit a CTL and/or HTL response ex vivo, as well. The 

resulting CTL or HTL cells, can be used to treat tumors in patients that do not respond to other conventional forms 
of therapy, or will not respond to a therapeutic vaccine peptide or nucleic acid in accordance with the invention. Ex 
vivo CTL or HTL responses to a particular antigen are induced by incubating in tissue culture the patient's, or 
genetically compatible, CTL or HTL precursor cells together with a source of antigen-presenting cells (APC), such 

25 as dendritic cells, and the appropriate immunogenic peptide. After an appropriate incubation time (typically about 
7-28 days), in which the precursor cells are activated and expanded into effector cells, the cells are infused back into 
the patient, where they will destroy (CTL) or facilitate destruction (HTL) of their specific target cell (e.g., a tumor 
cell). Transfected dendritic cells may also be used as antigen presenting cells. 

X.E. Administration of Vaccines for Therapeutic or Prophylactic Purposes 

30 Pharmaceutical and vaccine compositions of the invention are typically used to treat and/or prevent a 

cancer that expresses or overexpresses 1 2 1 P 1 F 1 . In therapeutic applications, peptide and/or nucleic acid 
compositions are administered to a patient in an amount sufficient to elicit an effective B cell, CTL and/or HTL 
response to the antigen and to cure or at least partially arrest or slow symptoms and/or complications. An amount 
adequate to accomplish this is defined as "therapeutically effective dose." Amounts effective for this use will 

35 depend on, e.g., the particular composition administered, the manner of administration, the stage and severity of the 
disease being treated, the weight and general state of health of the patient, and the judgment of the prescribing 
physician. 

For pharmaceutical compositions, the immunogenic peptides of the invention, or DNA encoding them, are 
generally administered to an individual already bearing a tumor that expresses 1 2 1 P 1 F 1 . The peptides or DNA 
40 encoding them can be administered individually or as fusions of one or more peptide sequences. Patients can be 
treated with the immunogenic peptides separately or in conjunction with other treatments, such as surgery, as 
appropriate. 
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For therapeutic use, administration should generally begin at the first diagnosis of 121PlFl-associated 
cancer. This is followed by boosting doses until at least symptoms are substantially abated and for a period 
thereafter. The embodiment of the vaccine composition (i.e., including, but not limited to embodiments such as 
peptide cocktails, polyepitopic polypeptides, minigenes, or TAA-specific CTLs or pulsed dendritic cells) delivered 
5 to the patient may vary according to the stage of the disease or the patient's health status. For example, in a patient 
with a tumor that expresses 121 PI Fl, a vaccine comprising 121 PI Fl -specific CTL may be more efficacious in 
killing tumor cells in patient with advanced disease than alternative embodiments. 

It is generally important to provide an amount of the peptide epitope delivered by a mode of administration 
sufficient to effectively stimulate a cytotoxic T cell response; compositions which stimulate helper T cell responses 
10 can also be given in accordance with this embodiment of the invention. 

The dosage for an initial therapeutic immunization generally occurs in a unit dosage range where the lower 
value is about I, 5, 50, 500, or 1,000 ug and the higher value is about 10,000; 20,000; 30,000; or 50,000 ug. 
Dosage values for a human typically range from about 500 ug to about 50,000 ug per 70 kilogram patient. Boosting 
dosages of between about 1.0 ug to about 50,000 ug of peptide pursuant to a boosting regimen over weeks to 
15 months may be administered depending upon the patient's response and condition as determined by measuring the 
specific activity of CTL and HTL obtained from the patient's blood. Administration should continue until at least 
clinical symptoms or laboratory tests indicate that the neoplasia, has been eliminated or reduced and for a period 
thereafter. The dosages, routes of administration, and dose schedules are adjusted in accordance with 
methodologies known in the art. 
20 In certain embodiments, the peptides and compositions of the present invention are employed in serious 

disease states, that is, life-threatening or potentially life threatening situations. In such cases, as a result of the 
minimal amounts of extraneous substances and the relative nontoxic nature of the peptides in preferred 
compositions of the invention, it is possible and may be felt desirable by the treating physician to administer 
substantial excesses of these peptide compositions relative to these stated dosage amounts. 
25 The vaccine compositions of the invention can also be used purely as prophylactic agents. Generally the 

dosage for an initial prophylactic immunization generally occurs in a unit dosage range where the lower value is 
about 1, 5, 50, 500, or 1000 ug and the higher value is about 10,000; 20,000; 30,000; or 50,000 ug. Dosage values 
for a human typically range from about 500 ug to about 50,000 ug per 70 kilogram patient. This is followed by 
boosting dosages of between about 1.0 ug to about 50,000 ug of peptide administered at defined intervals from 
30 about four weeks to six months after the initial administration of vaccine. The immunogenicity of the vaccine can 
be assessed by measuring the specific activity of CTL and HTL obtained from a sample of the patient's blood. 

The pharmaceutical compositions for therapeutic treatment are intended for parenteral, topical, oral, nasal, 
intrathecal, or local (e.g. as a cream or topical ointment) administration. Preferably, the pharmaceutical 
compositions are administered parentally, e.g., intravenously, subcutaneously, intradermal ly, or intramuscularly. 
35 Thus, the invention provides compositions for parenteral administration which comprise a solution of the 
immunogenic peptides dissolved or suspended in an acceptable carrier, preferably an aqueous carrier. 

A variety of aqueous carriers may be used, e.g., water, buffered water, 0.8% saline, 0.3% glycine, 
hyaluronic acid and the like. These compositions may be sterilized by conventional, well-known sterilization 
techniques, or may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or lyophilized, 
40 the lyophilized preparation being combined with a sterile solution prior to administration. 

The compositions may contain pharmaceutical ly acceptable auxiliary substances as required to 
approximate physiological conditions, such as pH-adjusting and buffering agents, tonicity adjusting agents, wetting 
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agents, preservatives, and the like, for example, sodium acetate, sodium lactate, sodium chloride, potassium 
chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of peptides of the invention in the pharmaceutical formulations can vary widely, i.e., 
from less than about 0. 1%, usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be 
5 selected primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of administration 
selected. 

A human unit dose form of a composition is typically included in a pharmaceutical composition that 
comprises a human unit dose of an acceptable carrier, in one embodiment an aqueous carrier, and is administered in 
a volume/quantity that is known by those of skill in the art to be used for administration of such compositions to 

10 humans {see, e.g., Remington's Pharmaceutical Sciences , 17 th Edition, A. Gennaro, Editor, Mack Publishing Co., 
Easton, Pennsylvania, 1985). For example a peptide dose for initial immunization can be from about 1 to about 
50,000 ng, generally 100-5,000 ug, for a 70 kg patient. For example, for nucleic acids an initial immunization may 
be performed using an expression vector in the form of naked nucleic acid administered IM (or SC or ID) in the 
amounts of 0.5-5 mg at multiple sites. The nucleic acid (0.1 to 1000 ug) can also be administered using a gene gun. 

15 Following an incubation period of 3-4 weeks, a booster dose is then administered. The booster can be recombinant 
fowlpox virus administered at a dose of 5-10 7 to 5xl0 9 pfu. 

For antibodies, a treatment generally involves repeated administration of the anti-121PlFl antibody 
preparation, via an acceptable route of administration such as intravenous injection (IV), typically at a dose in the 
range of about 0.1 to about 10 mg/kg body weight. In general, doses in the range of 10-500 mg mAb per week are 

20 effective and well tolerated. Moreover, an initial loading dose of approximately 4 mg/kg patient body weight IV, 
followed by weekly doses of about 2 mg/kg IV of the anti- 121P1F1 mAb preparation represents an acceptable 
dosing regimen. As appreciated by those of skill in the art, various factors can influence the ideal dose in a 
particular case. Such factors include, for example, half life of a composition, the binding affinity of an Ab, the 
immunogenicity of a substance, the degree of 121 PI Fl expression in the patient, the extent of circulating shed 

25 121P1F1 antigen, the desired steady-state concentration level, frequency of treatment, and the influence of 

chemotherapeutic or other agents used in combination with the treatment method of the invention, as well as the 
health status of a particular patient. Non-limiting preferred human unit doses are, for example, 500ug - lmg, lmg - 
50mg, 50mg - lOOmg, lOOmg - 200mg, 200mg - 300mg, 400mg - 500mg, 500mg - 600mg, 600mg - 700mg, 700mg 
- 800mg, 800mg - 900mg, 900mg - lg, or lmg - 700mg. In certain embodiments, the dose is in a range of 2-5 

30 mg/kg body weight, e.g., with follow on weekly doses of 1-3 mg/kg; 0.5mg, 1, 2, 3, 4, 5, 6, 7, 8, 9, lOmg/kg body 
weight followed, e.g., in two, three or four weeks by weekly doses; 0.5 - lOmg/kg body weight, e.g., followed in 
two, three or four weeks by weekly doses; 225, 250, 275, 300, 325, 350, 375, 400mg m 2 of body area weekly; 1- 
600mg m 2 of body area weekly; 225-400mg m 2 of body area weekly; these does can be followed by weekly doses 
for 2, 3,4, 5,6, 7, 8,9, 19, 11, 1 2 or more weeks. 

35 In one embodiment, human unit dose forms of polynucleotides comprise a suitable dosage range or 

effective amount that provides any therapeutic effect. As appreciated by one of ordinary skill in the art a 
therapeutic effect depends on a number of factors, including the sequence of the polynucleotide, molecular weight 
of the polynucleotide and route of administration. Dosages are generally selected by the physician or other health 
care professional in accordance with a variety of parameters known in the art, such as severity of symptoms, history 

40 of the patient and the like. Generally, for a polynucleotide of about 20 bases, a dosage range may be selected from, 
for example, an independently selected lower limit such as about 0.1, 0.25, 0.5, 1, 2, 5, 10, 20, 30, 40, 50, 60, 70, 
80, 90, 100, 200, 300, 400 or 500 mg/kg up to an independently selected upper limit, greater than the lower limit, of 
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about 60, 80, 100, 200, 300, 400, 500, 750, 1000, 1500, 2000, 3000, 4000, 5000, 6000, 7000, 8000, 9000 or 10,000 
mg/kg. For example, a dose may be about any of the following: 0.1 to 100 mg/kg, 0.1 to 50 mg/kg, 0.1 to 25 
mg/kg, 0.1 to 10 mg/kg, 1 to 500 mg/kg, 100 to 400 mg/kg, 200 to 300 mg/kg, 1 to 100 mg/kg, 100 to 200 mg/kg, 
300 to 400 mg/kg, 400 to 500 mg/kg, 500 to 1000 mg/kg, 500 to 5000 mg/kg, or 500 to 10,000 mg/kg. Generally, 

5 parenteral routes of administration may require higher doses of polynucleotide compared to more direct application 
to the nucleotide to diseased tissue, as do polynucleotides of increasing length. 

In one embodiment, human unit dose forms of T-cells comprise a suitable dosage range or effective 
amount that provides any therapeutic effect. As appreciated by one of ordinary skill in the art, a therapeutic effect 
depends on a number of factors. Dosages are generally selected by the physician or other health care professional 

10 in accordance with a variety of parameters known in the art, such as severity of symptoms, history of the patient and 
the like. A dose may be about 10 4 cells to about 10 6 cells, about 10 6 cells to about 10 8 cells, about 10 8 to about 10" 
cells, or about 10 8 to about 5 x 10 10 cells. A dose may also about 10 6 cells/m 2 to about 10 10 cells/m 2 , or about 10 6 
cells/m 2 to about 10 8 cells/m 2 . 

Proteins(s) of the invention, and/or nucleic acids encoding the protein(s), can also be administered via 

1 5 liposomes, which may also serve to: 1 ) target the proteins(s) to a particular tissue, such as lymphoid tissue; 2) to 
target selectively to diseases cells; or, 3) to increase the half-life of the peptide composition. Liposomes include 
emulsions, foams, micelles, insoluble monolayers, liquid crystals, phospholipid dispersions, lamellar layers and the 
like. In these preparations, the peptide to be delivered is incorporated as part of a liposome, alone or in conjunction 
with a molecule which binds to a receptor prevalent among lymphoid cells, such as monoclonal antibodies which 

20 bind to the CD45 antigen, or with other therapeutic or immunogenic compositions. Thus, liposomes either filled or 
decorated with a desired peptide of the invention can be directed to the site of lymphoid cells, where the liposomes 
then deliver the peptide compositions. Liposomes for use in accordance with the invention are formed from 
standard vesicle-forming lipids, which generally include neutral and negatively charged phospholipids and a sterol, 
such as cholesterol. The selection of lipids is generally guided by consideration of, e.g., liposome size, acid lability 

25 and stability of the liposomes in the blood stream. A variety of methods are available for preparing liposomes, as 
described in, e.g., Szoka, et ai. t Ann. Rev. Biophys. Bioeng. 9:467 (1980), and U.S. Patent Nos. 4,235,871, 
4,501,728, 4,837,028, and 5,019,369. 

For targeting cells of the immune system, a ligand to be incorporated into the liposome can include, e.g., 
antibodies or fragments thereof specific for cell surface determinants of the desired immune system cells. A 

30 liposome suspension containing a peptide may be administered intravenously, locally, topically, etc. in a dose which 
varies according to, inter alia, the manner of administration, the peptide being delivered, and the stage of the 
disease being treated. 

For solid compositions, conventional nontoxic solid carriers may be used which include, for example, 
pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharin, talcum, cellulose, 

35 glucose, sucrose, magnesium carbonate, and the like. For oral administration, a pharmaceutically acceptable 

nontoxic composition is formed by incorporating any of the normally employed excipients, such as those carriers 
previously listed, and generally 10-95% of active ingredient, that is, one or more peptides of the invention, and 
more preferably at a concentration of 25%-75%. 

For aerosol administration, immunogenic peptides are preferably supplied in finely divided form along 

40 with a surfactant and propellant. Typical percentages of peptides are about 0.0 1 %-20% by weight, preferably about 
1%-10%. The surfactant must, of course, be nontoxic, and preferably soluble in the propellant. Representative of 
such agents are the esters or partial esters of fatty acids containing from about 6 to 22 carbon atoms, such as 
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caproic, octanoic, lauric, palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an aliphatic polyhydric 
alcohol or its cyclic anhydride. Mixed esters, such as mixed or natural glycerides may be employed. The surfactant 
may constitute about 0.1%-20% by weight of the composition, preferably about 0.25-5%. The balance of the 
composition is ordinarily propellant. A carrier can also be included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

XL) Diagnostic and Prognostic Embodiments of 121P1F1 . 

As disclosed herein, 121P1F1 polynucleotides, polypeptides, reactive cytotoxic T cells (CTL), reactive 
helper T cells (HTL) and anti-polypeptide antibodies are used in well known diagnostic, prognostic and therapeutic 
assays that examine conditions associated with dysregulated cell growth such as cancer, in particular the cancers 
listed in Table I (see, e.g., both its specific pattern of tissue expression as well as its overexpression in certain 
cancers as described for example in Example 4). 

12 1 PI Fl can be analogized to a prostate associated antigen PSA, the archetypal marker that has been used 
by medical practitioners for years to identify and monitor the presence of prostate cancer (see, e.g., Merrill et al, J. 
Urol. 163(2): 503-5120 (2000); Polascik et al, J. Urol. Aug; 162(2):293-306 (1999) and Fortier et al, J. Nat. 
Cancer Inst. 91(19): 1635-1640(1999)). A variety of other diagnostic markers are also used in similar contexts 
including p53 and K-ras (see, e.g., Tulchinsky et al, Int J Mol Med 1999 Jul 4(1):99-102 and Minimoto et al, 
Cancer Detect Prev 2000;24(1):1-12). Therefore, this disclosure of 121 PI Fl polynucleotides and polypeptides (as 
well as 121 PI Fl polynucleotide probes and anti-121PlFl antibodies used to identify the presence of these 
molecules) and their properties allows skilled artisans to utilize these molecules in methods that are analogous to 
those used, for example, in a variety of diagnostic assays directed to examining conditions associated with cancer. 

Typical embodiments of diagnostic methods which utilize the 121 PI Fl polynucleotides, polypeptides, 
reactive T cells and antibodies are analogous to those methods from well-established diagnostic assays which 
employ, e.g., PSA polynucleotides, polypeptides, reactive T cells and antibodies. For example, just as PSA 
polynucleotides are used as probes (for example in Northern analysis, see, e.g., Sharief et al, Biochem. Mol. Biol. 
Int. 33(3):567-74(1994)) and primers (for example in PCR analysis, see, e.g., Okegawa et al, J. Urol. 163(4): 1 189- 
1 190 (2000)) to observe the presence and/or the level of PSA mRNAs in methods of monitoring PSA 
overexpression or the metastasis of prostate cancers, the 121 PI Fl polynucleotides described herein can be utilized 
in the same way to detect 121P1F1 overexpression or the metastasis of prostate and other cancers expressing this 
gene. Alternatively, just as PSA polypeptides are used to generate antibodies specific for PSA which can then be 
used to observe the presence and/or the level of PSA proteins in methods to monitor PSA protein overexpression 
(see, e.g., Stephan et al, Urology 55(4):560-3 (2000)) or the metastasis of prostate cells (see, e.g., Alanen et al, 
Pathol. Res. Pract. 192(3):233-7 (1996)), the 121P1F1 polypeptides described herein can be utilized to generate 
antibodies for use in detecting 121 PI Fl overexpression or the metastasis of prostate cells and cells of other cancers 
expressing this gene. 

Specifically, because metastases involves the movement of cancer cells from an organ of origin (such as 
the lung or prostate gland etc.) to a different area of the body (such as a lymph node), assays which examine a 
biological sample for the presence of cells expressing 121P1F1 polynucleotides and/or polypeptides can be used to 
provide evidence of metastasis. For example, when a biological sample from tissue that does not normally contain 
1 2 1 P 1 F 1 -expressing cells (lymph node) is found to contain 1 2 1 P 1 F 1 -expressing cells such as the 1 2 1 P 1 F 1 
expression seen in LAPC4 and LAPC9, xenografts isolated from lymph node and bone metastasis, respectively, this 
finding is indicative of metastasis. 
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Alternatively 121P1F1 polynucleotides and/or polypeptides can be used to provide evidence of cancer, for 
example, when cells in a biological sample that do not normally express 1 2 1 P 1 F 1 or express 1 2 1 P 1 F 1 at a different 
level are found to express 121P1F1 or have an increased expression of 121P1F1 (see, e.g., the 121P1F1 expression 
in the cancers listed in Table I and in patient samples etc. shown in the accompanying Figures). In such assays, 
artisans may further wish to generate supplementary evidence of metastasis by testing the biological sample for the 
presence of a second tissue restricted marker (in addition to 121 PI Fl) such as PSA, PSCA etc. (see, e.g., Alanen et 
a!., Pathol. Res. Pract. 192(3): 233-237 (1996)). 

Just as PSA polynucleotide fragments and polynucleotide variants are employed by skilled artisans for use 
in methods of monitoring PSA, 121P1F1 polynucleotide fragments and polynucleotide variants are used in an 
analogous manner. In particular, typical PSA polynucleotides used in methods of monitoring PSA are probes or 
primers which consist of fragments of the PSA cDNA sequence. Illustrating this, primers used to PCR amplify a 
PSA polynucleotide must include less than the whole PSA sequence to function in the polymerase chain reaction. 
In the context of such PCR reactions, skilled artisans generally create a variety of different polynucleotide 
fragments that can be used as primers in order to amplify different portions of a polynucleotide of interest or to 
optimize amplification reactions (see, e.g., Caetano-Anolles, G. Biotechniques 25(3): 472-476, 478-480 (1998); 
Robertson et al, Methods Mol. Biol. 98:121-154 (1998)). An additional illustration of the use of such fragments is 
provided in Example 4, where a 121 PI Fl polynucleotide fragment is used as a'probe to show the expression of 
121P1F1 RNAs in cancer cells. In addition, variant polynucleotide sequences are typically used as primers and 
probes for the corresponding mRNAs in PCR and Northern analyses (see, e.g., Sawai et al, Fetal Diagn. Ther. 1996 
Nov-Dec 1 1(6):407-13 and Current Protocols In Molecular Biology, Volume 2, Unit 2, Frederick M. Ausubel et al 
eds., 1995)). Polynucleotide fragments and variants are useful in this context where they are capable of binding to a 
target polynucleotide sequence (e.g., a 121 PI Fl polynucleotide shown in Figure 2 or variant thereof) under 
conditions of high stringency. 

Furthermore, PSA polypeptides which contain an epitope that can be recognized by an antibody or T cell 
that specifically binds to that epitope are used in methods of monitoring PSA. 1 2 1 P 1 F 1 polypeptide fragments and 
polypeptide analogs or variants can also be used in an analogous manner. This practice of using polypeptide 
fragments or polypeptide variants to generate antibodies (such as anti-PSA antibodies or T cells) is typical in the art 
with a wide variety of systems such as fusion proteins being used by practitioners (see, e.g., Current Protocols In 
Molecular Biology, Volume 2, Unit 16, Frederick M. Ausubel et al eds., 1995). In this context, each epitope(s) 
functions to provide the architecture with which an antibody or T cell is reactive. Typically, skilled artisans create a 
variety of different polypeptide fragments that can be used in order to generate immune responses specific for 
different portions of a polypeptide of interest (see, e.g., U.S. Patent No. 5,840,501 and U.S. Patent No. 5,939,533). 
For example it may be preferable to utilize a polypeptide comprising one of the 121 PI Fl biological motifs 
discussed herein or a motif-bearing subsequence which is readily identified by one of skill in the art based on motifs 
available in the art. Polypeptide fragments, variants or analogs are typically useful in this context as long as they 
comprise an epitope capable of generating an antibody or T cell specific for a target polypeptide sequence (e.g. a 
121P1F1 polypeptide shown in Figure 3). 

As shown herein, the 121 PI Fl polynucleotides and polypeptides (as well as the 121 PI Fl polynucleotide 
probes and anti- 1 2 1 P 1 F 1 antibodies or T cells used to identify the presence of these molecules) exhibit specific 
properties that make them useful in diagnosing cancers such as those listed in Table I. Diagnostic assays that 
measure the presence of 121 PI Fl gene products, in order to evaluate the presence or onset of a disease condition 
described herein, such as prostate cancer, are used to identify patients for preventive measures or further 
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monitoring, as has been done so successfully with PSA. Moreover, these materials satisfy a need in the art for 
molecules having similar or complementary characteristics to PSA in situations where, for example, a definite 
diagnosis of metastasis of prostatic origin cannot be made on the basis of a test for PSA alone (see, e.g., Alanen et 
ai, Pathol. Res. Pract. 192(3): 233-237 (1996)), and consequently, materials such as 121P1F1 polynucleotides and 

5 polypeptides (as well as the 121P1F1 polynucleotide probes and anti-121PlFl antibodies used to identify the 
presence of these molecules) need to be employed to confirm a metastases of prostatic origin. 

Finally, in addition to their use in diagnostic assays, the 121 PI Fl polynucleotides disclosed herein have a 
number of other utilities such as their use in the identification of oncogenetic associated chromosomal abnormalities 
in the chromosomal region to which the 121P1F1 gene maps (see Example 3 below). Moreover, in addition to their 

10 use in diagnostic assays, the 121 PI Fl -related proteins and polynucleotides disclosed herein have other utilities such 
as their use in the forensic analysis of tissues of unknown origin (see, e.g., Takahama K Forensic Sci Int 1996 Jun 
28;80(l-2): 63-9). 

Additionally, 121 P1F1 -related proteins or polynucleotides of the invention can be used to treat a 
pathologic condition characterized by the over-expression of 121P1F1. For example, the amino acid or nucleic acid 
15 sequence of Figure 2 or Figure 3, or fragments of either, can be used to generate an immune response to a 121P1F1 
antigen. Antibodies or other molecules that react with 12 1 PI Fl can be used to modulate the function of this 
molecule, and thereby provide a therapeutic benefit. 

XIU Inhibition of 121P1F1 Protein Function 

20 The invention includes various methods and compositions for inhibiting the binding of 121P1F1 to its 

binding partner or its association with other protein(s) as well as methods for inhibiting 121P1F1 function. 
XILA/1 Inhibition of 121P1F1 With Intracellular Antibodies 

In one approach, a recombinant vector that encodes single chain antibodies that specifically bind to 
121P1F1 are introduced into 12 1P1F1 expressing cells via gene transfer technologies. Accordingly, the encoded 

25 single chain anti-121PlFl antibody is expressed intracellularly, binds to 121 PI Fl protein, and thereby inhibits its 
function. Methods for engineering such intracellular single chain antibodies are well known. Such intracellular 
antibodies, also known as "intrabodies", are specifically targeted to a particular compartment within the cell, 
providing control over where the inhibitory activity of the treatment is focused. This technology has been 
successfully applied in the art (for review, see Richardson and Marasco, 1995, TIBTECH vol. 13). Intrabodies have 

30 been shown to virtually eliminate the expression of otherwise abundant cell surface receptors (see, e.g., Richardson 
et ai, 1995, Proc. Natl. Acad. Sci. USA 92: 3137-3141; Beerli et aL, 1994, J. Biol. Chem. 289: 23931-23936; 
Deshane et aL, 1994, Gene Ther. 1: 332-337). 

Single chain antibodies comprise the variable domains of the heavy and light chain joined by a flexible 
linker polypeptide, and are expressed as a single polypeptide. Optionally, single chain antibodies are expressed as a 

35 single chain variable region fragment joined to the light chain constant region. Well-known intracellular trafficking 
signals are engineered into recombinant polynucleotide vectors encoding such single chain antibodies in order to 
precisely target the intrabody to the desired intracellular compartment. For example, intrabodies targeted to the 
endoplasmic reticulum (ER) are engineered to incorporate a leader peptide and, optionally, a C-terminal ER 
retention signal, such as the KDEL amino acid motif. Intrabodies intended to exert activity in the nucleus are 

40 engineered to include a nuclear localization signal. Lipid moieties are joined to intrabodies in order to tether the 
intrabody to the cytosolic side of the plasma membrane. Intrabodies can also be targeted to exert function in the 
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cytosol. For example, cytosolic intrabodies are used to sequester factors within the cytosoi, thereby preventing 
them from being transported to their natural cellular destination. 

In one embodiment, intrabodies are used to capture 121P1F1 in the nucleus, thereby preventing its activity 
within the nucleus. Nuclear targeting signals are engineered into such 121P1F1 intrabodies in order to achieve the 
desired targeting. Such 121P1F1 intrabodies are designed to bind specifically to a particular 121 PI Fl domain. In 
another embodiment, cytosolic intrabodies that specifically bind to a 121P1F1 protein are used to prevent 12 1 PI Fl 
from gaining access to the nucleus, thereby preventing it from exerting any biological activity within the nucleus 
(e.g., preventing 121P1F1 from forming transcription complexes with other factors). 

In order to specifically direct the expression of such intrabodies to particular cells, the transcription of the 
intrabody is placed under the regulatory control of an appropriate tumor-specific promoter and/or enhancer. In 
order to target intrabody expression specifically to prostate, for example, the PSA promoter and/or 
promoter/enhancer can be utilized (See, for example, U.S. Patent No. 5,919,652 issued 6 July 1999). 

XII.B/) Inhibition of 121P1F1 with Recombinant Proteins 

In another approach, recombinant molecules bind to 121 PI Fl and thereby inhibit 121P1F1 function. For 
example, these recombinant molecules prevent or inhibit 121P1F1 from accessing/binding to its binding partner(s) 
or associating with other protein(s). Such recombinant molecules can, for example, contain the reactive part(s) of a 
121P1F1 specific antibody molecule. In a particular embodiment, the 121P1F1 binding domain of a 121P1F1 binding 
partner is engineered into a dimeric fusion protein, whereby the fusion protein comprises two 121 PI Fl ligand binding 
domains linked to the Fc portion of a human IgG, such as human lgG 1 . Such IgG portion can contain, for example, the 
C H 2 and C H 3 domains and the hinge region, but not the C H 1 domain. Such dimeric fusion proteins are administered in 
soluble form to patients suffering from a cancer associated with the expression of 121 PI Fl, whereby the dimeric fusion 
protein specifically binds to 1 2 1 P 1 F 1 and blocks 1 2 1 P 1 F 1 interaction with a binding partner. Such dimeric fusion 
proteins are further combined into multimeric proteins using known antibody linking technologies. 

XILC.) Inhibition of 121P1F1 Transcription or Translation 

The present invention also comprises various methods and compositions for inhibiting the transcription of 
the 121 PI Fl gene. Similarly, the invention also provides methods and compositions for inhibiting the translation of 
121PIF1 mRNA into protein. 

In one approach, a method of inhibiting the transcription of the 1 2 1 P 1 F 1 gene comprises contacting the 
1 2 1 P 1 F 1 gene with a 1 2 1 P 1 F 1 antisense polynucleotide. In another approach, a method of inhibiting 1 2 1 P 1 F 1 
mRNA translation comprises contacting a 121P1F1 mRNA with an antisense polynucleotide. In another approach, 
a 121 PI Fl specific ribozyme is used to cleave a 121 PI Fl message, thereby inhibiting translation. Such antisense 
and ribozyme based methods can also be directed to the regulatory regions of the 121 PI Fl gene, such as 121 PI Fl 
promoter and/or enhancer elements. Similarly, proteins capable of inhibiting a 121P1F1 gene transcription factor 
are used to inhibit 121P1F1 mRNA transcription. The various polynucleotides and compositions useful in the 
aforementioned methods have been described above. The use of antisense and ribozyme molecules to inhibit 
transcription and translation is well known in the art. 

Other factors that inhibit the transcription of 121 PI Fl by interfering with 121P1F1 transcriptional 
activation are also useful to treat cancers expressing 121P1F1. Similarly, factors that interfere with 121P1F1 
processing are useful to treat cancers that express 1 2 1 P 1 F 1 . Cancer treatment methods utilizing such factors are 
also within the scope of the invention. 

XIl.D/l General Considerations for Therapeutic Strategies 
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Gene transfer and gene therapy technologies can be used to deliver therapeutic polynucleotide molecules to 
tumor cells synthesizing 121P1F1 (i.e., antisense, ribozyme, polynucleotides encoding intrabodies and other 121P1F1 
inhibitory molecules). A number of gene therapy approaches are known in the art. Recombinant vectors encoding 
121P1F1 antisense polynucleotides, ribozymes, factors capable of interfering with 121P1F1 transcription, and so forth, 
5 can be delivered to target tumor cells using such gene therapy approaches. 

The above therapeutic approaches can be combined with any one of a wide variety of surgical, chemotherapy 
or radiation therapy regimens. The therapeutic approaches of the invention can enable the use of reduced dosages of 
chemotherapy (or other therapies) and/or less frequent administration, an advantage for all patients and particularly for 
those that do not tolerate the toxicity of the chemotherapeutic agent well. 
1 o The anti-tumor activity of a particular composition (e.g., antisense, ribozyme, intrabody), or a combination of 

such compositions, can be evaluated using various in vitro and in vivo assay systems. In vitro assays that evaluate 
therapeutic activity include cell growth assays, soft agar assays and other assays indicative of tumor promoting activity, 
binding assays capable of determining the extent to which a therapeutic composition will inhibit the binding of 121 PI Fl 
to a binding partner, etc. 

1 5 In vivo, the effect of a 1 2 1 P 1 F 1 therapeutic composition can be evaluated in a suitable animal model. For 

example, xenogenic prostate cancer models can be used, wherein human prostate cancer explants or passaged xenograft 
tissues are introduced into immune compromised animals, such as nude or SCID mice (Klein et al., 1997, Nature 
Medicine 3: 402-408). For example, PCT Patent Application W098/16628 and U.S. Patent 6,107,540 describe 
various xenograft models of human prostate cancer capable of recapitulating the development of primary tumors, 

20 micrometastasis, and the formation of osteoblastic metastases characteristic of late stage disease. Efficacy can be 
predicted using assays that measure inhibition of tumor formation, tumor regression or metastasis, and the like. 

In vivo assays that evaluate the promotion of apoptosis are useful in evaluating therapeutic compositions. 
In one embodiment, xenografts from tumor bearing mice treated with the therapeutic composition can be examined 
for the presence of apoptotic foci and compared to untreated control xenograft-bearing mice. The extent to which 

25 apoptotic foci are found in the tumors of the treated mice provides an indication of the therapeutic efficacy of the 
composition. 

The therapeutic compositions used in the practice of the foregoing methods can be formulated into 
pharmaceutical compositions comprising a carrier suitable for the desired delivery method. Suitable carriers include 
any material that when combined with the therapeutic composition retains the anti-tumor function of the therapeutic 
30 composition and is generally non-reactive with the patient's immune system. Examples include, but are not limited 
to, any of a number of standard pharmaceutical carriers such as sterile phosphate buffered saline solutions, 
bacteriostatic water, and the like (see, generally, Remington's Pharmaceutical Sciences 16 th Edition, A. OsaL, Ed., 
1980). 

Therapeutic formulations can be solubilized and administered via any route capable of delivering the 
35 therapeutic composition to the tumor site. Potentially effective routes of administration include, but are not limited 
to, intravenous, parenteral, intraperitoneal, intramuscular, intratumor, intradermal, intraorgan, orthotopic, and the 
like. A preferred formulation for intravenous injection comprises the therapeutic composition in a solution of 
preserved bacteriostatic water, sterile unpreserved water, and/or diluted in polyvinylchloride or polyethylene bags 
containing 0.9% sterile Sodium Chloride for Injection, USP. Therapeutic protein preparations can be lyophilized 
40 and stored as sterile powders, preferably under vacuum, and then reconstituted in bacteriostatic water (containing 
for example, benzyl alcohol preservative) or in sterile water prior to injection. 
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Dosages and administration protocols for the treatment of cancers using the foregoing methods will vary with 
the method and the target cancer, and will generally depend on a number of other factors appreciated in the art. 

XIII.) Kits 

For use in the diagnostic and therapeutic applications described herein, kits are also within the scope of the 
invention. Such kits can comprise a carrier, package or container that is compartmentalized to receive one or more 
containers such as vials, tubes, and the like, each of the container(s) comprising one of the separate elements to be 
used in the method. For example, the container(s) can comprise a probe that is or can be detectably labeled. Such 
probe can be an antibody or polynucleotide specific for a 121PlFl-related protein or a 121 PI Fl gene or message, 
respectively. Where the method utilizes nucleic acid hybridization to detect the target nucleic acid, the kit can also 
have containers containing nucleotide(s) for amplification of the target nucleic acid sequence and/or a container 
comprising a reporter-means, such as a biotin-binding protein, such as avidin or streptavidin, bound to a reporter 
molecule, such as an enzymatic, florescent, or radioisotope label. The kit can include all or part of the amino acid 
sequence of Figure 2 or Figure 3 or analogs thereof, or a nucleic acid molecules that encodes such amino acid 
sequences. 

The kit of the invention will typically comprise the container described above and one or more other containers 
comprising materials desirable from a commercial and user standpoint, including buffers, diluents, filters, needles, 
syringes, and package inserts with instructions for use. 

A label can be present on the container to indicate that the composition is used for a specific therapy or non- 
therapeutic application, and can also indicate directions for either in vivo or in vitro use, such as those described above. 
Directions and or other information can also be included on an insert which is included with the kit. 

EXAMPLES: 

Various aspects of the invention are further described and illustrated by way of the several examples that 
follow, none of which are intended to limit the scope of the invention. 

Example I; SSH-Generated Isolation of a cDNA Fragment of the 121P1FI Gene 

Suppression Subtractive Hybridization (SSH) was used to identify cDNAs corresponding to genes that are 
differentially expressed in prostate cancer. The SSH reaction utilized cDNA from two LAPC-9 AD xenografts. 
Specifically, to isolate genes that are involved in the progression of androgen dependent (AD) prostate cancer to 
androgen independent (AI) cancer, the LAPC-9 AD xenograft in male SCID mice was used. Mice that harbored 
LAPC-9 AD xenografts were castrated when the tumors reached a size of 1 cm in diameter. The tumors regressed 
in size and temporarily stopped producing the androgen dependent protein PSA. Seven to fourteen days post- 
castration, PSA levels were detectable again in the blood of the mice. Eventually the tumors develop an Al 
phenotype and start growing again in the castrated males. Tumors were harvested at different time points after 
castration to identify genes that were turned on or off during the transition to androgen independence. 

The gene 121 PI Fl was derived from an LAPC-9 AD minus LAPC-9 AD (28 days post-castration) 
subtraction. The SSH DNA sequence of 254 bp (Figure I) is novel and did not exhibit significant homology to any 
known human genes in public databases. 

The 121P1F1 SSH cDNAof254 bp is listed in Figure 1. The full length 121P1F1 cDNAs and ORFs are 
described in Figure 2 with the protein sequences listed in Figure 3. 
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Materials and Methods 

LAPC Xenografts and Human Tissues : 

LAPC xenografts were obtained from Dr. Charles Sawyers (UCLA) and generated as described (Klein et 
al, 1997, Nature Med. 3: 402-408; Craft et al., 1999, Cancer Res. 59: 5030-5036). Androgen dependent and 
independent LAPC-4 xenografts LAPC-4 AD and Al, respectively) and LAPC-9 AD and Al xenografts were grown 
in male SC1D mice and were passaged as small tissue chunks in recipient males. LAPC-4 and -9 Al xenografts 
were derived from LAPC-4 or -9 AD tumors, respectively. To generate the Al xenografts, male mice bearing AD 
tumors were castrated and maintained for 2-3 months. After the tumors re-grew, the tumors were harvested and 
passaged in castrated males or in female SCID mice. 

RNA Isolation : 

Tumor tissues were homogenized in Trizol reagent (Life Technologies, Gibco BRL) using 10 ml/ g tissue 
or 10 ml/ 10 8 cells to isolate total RNA. Poly A RNA was purified from total RNA using Qiagen's Oligotex mRNA 
Mini and Midi kits. Total and mRNA were quantified by spectrophotometric analysis (O.D. 260/280 nm) and 
analyzed by gel electrophoresis. 

Oligonucleotides : 

The following HPLC purified oligonucleotides were used. 

DPNCDN (cDNA synthesis primer) : 
5'TTTTGATCAAGCTT 3 o3' (SEQ ID NO: 30W) 

Adaptor 1 : 

5 , CTAATACGACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAG3' (SEQ ID NO: 3_m^) 
3'GGCCCGTCCTAG5' (SEQ ID NO: 32744) 

Adaptor 2 : 

5'GTAATACGACTCACTATAGGGCAGCGTGGTCGCGGCCGAG3' (SEQ IDNO:J3W) 
3'CGGCTCCTAG5' (SEQ ID NO: _3434-8) 

PCR primer 1 : 

5 , CTAATACGACTCACTATAGGGC3' (SEQ ID NO: 35=m) 
Nested primer (NP)1 : 

5'TCGAGCGGCCGCCCGGGCAGGA3' (SEQ ID NO: 36=&$) 
Nested primer fNP)2 : 

5'AGCGTGGTCGCGGCCGAGGA3' (SEQ ID NO: 37724-) 
Suppression Subtractive Hybridization : 

Suppression Subtractive Hybridization (SSH) was used to identify cDN As corresponding to genes that 
may be differentially expressed in prostate cancer. The SSH reaction utilized cDNA from prostate cancer xenograft 
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LAPC-9AD. The gene 121P1F1 was derived from an LAPC-9 AD minus LAPC-9 AD (28 days post-castration) 
subtraction. The SSH DNA sequence (Figure 1) was identified. 

The cDNA derived from prostate cancer xenograft LAPC-9 AD tissue was used as the source of the 
"driver" cDNA, while the cDNA from prostate cancer xenograft LAPC-9 AD (28 days post-castration) was used as 
the source of the "tester" cDNA. Double stranded cDNAs corresponding to tester and driver cDNAs were 
synthesized from 2 ug of poly(A) + RNA isolated from the relevant tissue, as described above, using CLONTECH's 
PCR-Select cDNA Subtraction Kit and 1 ng of oligonucleotide DPNCDN as primer. First- and second-strand 
synthesis were carried out as described in the Kit's user manual protocol (CLONTECH Protocol No. PT1 1 17-1, 
Catalog No. K1804-1). The resulting cDN A was digested with Dpn II for 3 hrs at 37°C. Digested c DNA was 
extracted with phenol/chloroform (1:1) and ethanol precipitated. 

Tester cDNA was generated by diluting 1 ul of Dpn II digested cDNA from the relevant tissue source (see 
above) (400 ng) in 5 ul of water. The diluted cDNA (2 ul, 160 ng) was then ligated to 2 u.1 of Adaptor 1 and 
Adaptor 2(10 uM), in separate ligation reactions, in a total volume of 10 ul at 16°C overnight, using 400 U of T4 
DNA ligase (CLONTECH). Ligation was terminated with 1 ul of 0.2 M EDTA and heating at 72°C for 5 min. 

The first hybridization was performed by adding 1.5 ul (600 ng) of driver cDNA to each of two tubes 
containing 1.5 ul (20 ng) Adaptor 1- and Adaptor 2- ligated tester cDNA. In a final volume of 4 uj, the samples 
were overlaid with mineral oil, denatured in an MJ Research thermal cycler at 98°C for 1.5 minutes, and then were 
allowed to hybridize for 8 hrs at 68°C. The two hybridizations were then mixed together with an additional 1 ul of 
fresh denatured driver cDNA and were allowed to hybridize overnight at 68°C. The second hybridization was then 
diluted in 200 ul of 20 mM Hepes, pH 8.3, 50 mM NaCl, 0.2 mM EDTA, heated at 70°C for 7 min. and stored at - 
20°C. 

PCR Amplification. Cloning and Sequencing of Gene Fragments Generated from SSH: 
To amplify gene fragments resulting from SSH reactions, two PCR amplifications were performed. In the 
primary PCR reaction 1 ul of the diluted final hybridization mix was added to 1 uJ of PCR primer 1 (10 uM), 0.5 ul 
dNTP mix (10 uM), 2.5 ul 10 x reaction buffer (CLONTECH) and 0.5 ul 50 x Advantage cDNA polymerase Mix 
(CLONTECH) in a final volume of 25 ul. PCR 1 was conducted using the following conditions: 75°C for 5 min., 
94°C for 25 sec, then 27 cycles of 94°C for 10 sec, 66°C for 30 sec, 72°C for 1 .5 min. Five separate primary PCR 
reactions were performed for each experiment. The products were pooled and diluted 1:10 with water. For the 
secondary PCR reaction, 1 ul from the pooled and diluted primary PCR reaction was added to the same reaction 
mix as used for PCR 1 , except that primers NP1 and NP2 (10 uM) were used instead of PCR primer 1 . PCR 2 was 
performed using 10-12 cycles of 94°C for 10 sec, 68°C for 30 sec, and 72°C for 1 .5 minutes. The PCR products 
were analyzed using 2% agarose gel electrophoresis. 

The PCR products were inserted into pCR2.1 using the T/A vector cloning kit (lnvitrogen). Transformed 
E. coli were subjected to blue/white and ampicillin selection. White colonies were picked and arrayed into 96 well 
plates and were grown in liquid culture overnight. To identify inserts, PCR amplification was performed on 1 ml of 
bacterial culture using the conditions of PCR1 and NP 1 and NP2 as primers. PCR products were analyzed using 2% 
agarose gel electrophoresis. 

Bacterial clones were stored in 20% glycerol in a 96 well format. Plasmid DNA was prepared, sequenced, 
and subjected to nucleic acid homology searches of the GenBank, dBest, and NCI-CGAP databases. 
RT-PCR Expression Analysis : 
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First strand cDNAs can be generated from 1 \ig of mRNA with oligo (dT)12-18 priming using the 
Gibco-BRL Superscript Preamplification system. The manufacturer's protocol was used which included an 
incubation for 50 min at 42°C with reverse transcriptase followed by RNAse H treatment at 37°C for 20 min. After 
completing the reaction, the volume can be increased to 200 ul with water prior to normalization. 
5 Normalization of the first strand cDNAs from multiple tissues was performed by using the primers 

5'atatcgccgcgctcgtcgtcgacaa 3' (SEQ ID NO: 38332) and 5'agccacacgcagctcattgtagaagg 3' (SEQ ID NO: 3972^) to 
amplify P-actin. First strand cDNA (5 ul) were amplified in a total volume of 50 uJ containing 0.4 uM primers, 0.2 
uM each dNTPs, 1XPCR buffer (Clontech, 10 mM Tris-HCL, 1.5 mM MgCl 2 , 50 mM KC1, pH8.3) and IX Klentaq 
DNA polymerase (Clontech). Five ul of the PCR reaction can be removed at 18, 20, and 22 cycles and used for 

10 agarose gel electrophoresis. PCR was performed using an MJ Research thermal cycler under the following 

conditions: Initial denaturation can be at 94°C for 15 sec, followed by a 18, 20, and 22 cycles of 94°C for 15, 65°C 
for 2 min, 72°C for 5 sec. A final extension at 72°C was carried out for 2 min. After agarose gel electrophoresis, 
the band intensities of the 283 bp P-actin bands from multiple tissues were compared by visual inspection. Dilution 
factors for the first strand cDN As were calculated to result in equal p-actin band intensities in all tissues after 22 

15 cycles of PCR. Three rounds of normalization can be required to achieve equal band intensities in all tissues after 
22 cycles of PCR. 

To determine expression levels of the 121 PI Fl gene, 5 ul of normalized first strand cDNA were analyzed 
by PCR using 26, and 30 cycles of amplification. Semi-quantitative expression analysis can be achieved by 
comparing the PCR products at cycle numbers that give light band intensities. 
20 A typical RT-PCR expression analysis is shown in Figure 17. RT-PCR expression analysis was performed on first 
strand cDNAs generated using pools of tissues from multiple samples. The cDNAs were shown to be normalized 
using beta-actin primers. PCR Expression was observed in human testis, prostate cancer xenografts, colon cancer 
tissue pools, lung cancer tissue pools, kidney cancer tissue pools, bladder cancer tissue pools, and prostate cancer 
tissue pools. 

25 

Example 2: Full Length Cloning of 121P1F1 and Homology Comparis on to Known Sequences 

To isolate genes that are involved in the progression of androgen dependent (AD) prostate cancer to 
androgen independent (AI) cancer, an experiment was conducted with the LAPC-9AD xenograft in male SCID 
mice. Mice that harbored LAPC-9AD xenografts were castrated when the tumors reached a size of 1 cm in 
30 diameter. The tumors regressed in size and temporarily stopped producing the androgen dependent protein PSA. 
Seven to fourteen days post-castration, PSA levels were detectable again in the blood of the mice. Eventually the 
tumors develop an AI phenotype and start growing again in the castrated males. Tumors were harvested at different 
time points after castration to identify genes that are turned on or off during the transition to androgen 
independence. 

35 The gene 121P1F1 was derived from an LAPC-9AD minus LAPC-9AD (28 days post-castration) 

subtraction. The SSH DNA sequence of 254 bp (Figure I) is novel and did not exhibit significant homology to any 
known human genes in public databases. 

A cDNA (clone A) of 863 bp was isolated from a Human Testis cDNA library, revealing an ORF of 205 
amino acids (Figure 2 and Figure 3). It is probable that 121 PI Fl is a cytoplasmic protein based on two topology 

40 algorithms (J. Mol. Biol. 2000, 300: 1005 and Bioinformatics, 1998, 14:378) and based on its homology to 
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Dynactin. However, it is also possible that 121P1F1 is localized in the nucleus based on PSORT analysis 
(http://psort.nibb.ac.jp:8800/form.html). 

Sequence analysis of 121P1F1 reveals highest homology to human GAJ protein (Figure 4C); the two 
proteins are 100% homologous over a 205 amino acid region. 121P1F1 also displays homology to a mouse putative 
5 protein (Figure 4D). The two proteins are 89% identical over a 202 amino acid region. Also, 121P1F1 shows 40% 
identity over a 202 amino acid region with the 24.2kDa hypothetical coiled-coil protein of fission yeast. (Figure 4E) 

The 121P1F1 cDNA was deposited on Mar 1, 2001 with the American Type Culture Collection (ATCC; 
Manassas, VA), and has been assigned Accession No. PTA-3139. 

10 Example 3: Chromosomal Localization 

Chromosomal localization can implicate genes in disease pathogenesis. Several chromosome mapping 
approaches are available, including fluorescent in situ hybridization (FISH), human/hamster radiation hybrid (RH) 
panels (Walter et aL, 1994; Nature Genetics 7:22; Research Genetics, Huntsville Al), human-rodent somatic cell 
hybrid panels such as is available from the Coriell Institute (Camden, New Jersey), and genomic viewers utilizing 
15 BLAST homologies to sequenced and mapped genomic clones (NCBI, Bethesda, Maryland). 

121P1F1 maps to chromosome 4q, using 121P1F1 sequence and the NCBI BLAST tool located on the 
World Wide Web afc ( http://www .ncbi.nlm.nih.gov/genome/seq/page.cgi?F=HsBlast.html&&ORG=Hs). 

20 Example 4: Expression Analysis of 121P1F1 in Normal Tissues an d Patient Specimens 

Expression analysis by RT-PCR demonstrated that 121P1F1 expression is reminiscent of a cancer-testis 

gene (Figure 17A). Normal tissue expression is restricted to testis and, to a lower extent, it is detected in the 

thymus and ovary. Analysis of human patient cancer RNA pools shows expression in prostate, kidney, and bladder 

cancers, as well as lung cancers (Figure 17B). 
25 Extensive Northern blot analysis of 121P1F1 in 16 human normal tissues confirmed the expression 

observed by RT-PCR (Figure 18). A 1.2 kb transcript was detected in testis and at lower levels in thymus. 

121P1F1 expression was also shown in prostate cancer xenografts and in all cancer cell lines tested, such as in 

prostate (LAPC 4AD, LAPC 4AI, LAPC 9 AD, LAPC 9AI, LNCaP, PC-3, DU145 Tsu-Prl, and LAPC4); bladder 

(HT1 197, SCaBER, UM-UC-3, TCCSUP, J82, 5637), lung (A427, NCI-H82, NCI-H146), kidney (769-P, A-498, 
30 CAKI-1, SW 839), pancreas (PANC-1, Bx PC-3, HPAC, Capan-1); colon (SK-CO-1, Caco-2, LoVo, T84, Colo205) 

and in the cancer cell lines 293T, RD-ES and KCL22.(Figure 19). These results indicated that 121P1F1 is a testis 

specific gene that is upregulated in cancers. 

Northern blot analysis showed that 121P1F1 is expressed in prostate tumor tissues derived from prostate 

cancer patients (Figure 20). It was also expressed in kidney, cervix, breast and stomach patient cancer samples 
35 (Figure 21). The expression detected in normal adjacent tissues (isolated from diseased tissues) but not in normal 

tissues, isolated from healthy donors, indicate that these tissues are not fully normal and that 121P1F1 is expressed 

in early stage tumors, and thus can be used as a diagnostic target. 

Since 121 PI Fl was derived from a LAPC-9 AD minus LAPC-9 AD (28 days post-castration) subtraction, 

an assay was performed for androgen regulation of 1 2 1 P 1 F 1 (Figure 22). LAPC-4 cells were grown in charcoal- 
40 stripped medium and stimulated with the synthetic androgen mibolerone, for either 14 or 24 hours. It was shown 

that the expression of 121 PI FI went down in absence of normal serum, and is modulated in presence of 

mibolerone, 24 hours after stimulation. The experimental samples were confirmed by testing for the expression of 

the androgen-regulated prostate cancer gene TMPRSS2. This experiment showed that, as expected, TMPRSS2 
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levels go down in presence of charcoal-stripped serum, and expression is induced at 14 and 24 hours in presence of 
mibolerone. 

Figure 15 shows androgen regulation of 121P1F1 in vivo. Male mice were injected with LAPC-9AD 
tumor cells. When tumor reached a palpable size (0.3-0.5cm in diameter), mice were castrated and tumors 

5 harvested at different time points following castration. RNA was isolated from the xenograft tissues. Northern 
blots with 10 ug of total RN A/lane were probed with the 121 PI Fl SSH fragment; size standards in kilobases (kb) 
are indicated on the side. Results show that expression of 121 PI Fl is slightly downregulated 7 days after 
castration. The protein TMPRSS2 was used as a positive control. A picture of the ethidium-bromide staining of the 
RNA gel is also presented (lowest panel). 

10 1 2 1 P 1 F 1 expression is reminiscent of a cancer-testis gene. Its restricted normal tissue expression and the 

upregulation detected in prostate cancer, bladder cancer, kidney cancer, colon cancer, and lung cancer, indicate that 
121P1F1 is therapeutic and prophylactic target and a diagnostic and prognostic marker for human cancers. 



15 Example 5: Splice Variants of 121P1F1 and Single Nucleotide Polymorphisms 

Splice Variants 

Splice variants are alternatively spliced transcripts. When a gene is transcribed from genomic DNA, the 
initial RNA is generally spliced to produce functional mRNA, which has only exons and is used for translation into 
an amino acid sequence. Accordingly, a given gene can have zero to many alternatively spliced mRNA products. 

20 Alternative transcripts each have a unique exon makeup, and can have different coding and/or non-coding (5' or 3' 
end) portions, from the original transcript. Alternative transcripts can code for similar proteins with the same or a 
similar function or may encode proteins with different functions, and may be expressed in the same tissue at the 
same time, or at different tissue at different times. Proteins encoded by alternative transcripts can have similar or 
different cellular or extracellular localizations, e.g., be secreted. 

25 Splice variants are identified by a variety of art-accepted methods. For example, splice variants are 

identified by use of EST data. First, all human ESTs were grouped into clusters which show direct or indirect 
identity with each other. Second, ESTs in the same cluster were further grouped into sub-clusters and assembled 
into a consensus sequence. The starting gene is compared to the consensus sequence(s). Each consensus sequence 
is a potential splice variant for that gene (see, e.g., Web URL located on the World Wide Web at 

30 www.doubletwist.com/products/cl l_agentsOverview.jhtml). Even when a variant is identified that is not a full- 
length clone, that portion of the variant is very useful for antigen generation and for further cloning of the full- 
length splice variant, using techniques known in the art. 

Moreover, computer programs are available in the art that identify splice variants based on genomic 
sequences. Genomic-based variant identification programs include FgenesH (A. Salamov and V. Solovyev, "Ab 

35 initio gene finding in Drosophila genomic DNA," Genome Research. 2000 April; 10(4):5 16-22); Grail (Web URL 
compbio.ornl.gov/Grail-bin/EmptvGrailForm ) and GenScan (Web URL genes.mit.edu/GENSCAN.html). For a 
general discussion of splice variant identification protocols see., e.g., Southan C., "A genomic perspective on 
human proteases," FEBS Lett. (2001 Jun 8) 498(2-3):214-8; and de Souza SJ, et al., "Identification of human 
chromosome 22 transcribed sequences with ORF expressed sequence tags," Proc. Natl. Acad. Sci. USA. (2000 Nov 

40 7) 97(23): 12690-3. 

For variants identified by the EST-based method, Table XXII shows the nucleotide sequences of the splice 
variants. Table XXIII shows the alignment of the splice variant with the 121 PI Fl nucleic acid sequence. Table 
XXIV displays alignments of an amino acid sequence encoded by a splice variant with 121P1F1 v. 1. Table XXV 
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lays out the amino acid translation of the splice variant for the identified reading frame orientation. Tables XXII 
through XXV are set forth herein on a variant-by-variant basis. 

For variants identified by any one of the genomic sequence-based methods, Table XXII shows the 
nucleotide sequences of the splice variant. Table XXIII shows the alignment of the splice variant with the 121P1F1 
5 nucleic acid sequence. Table XXIV displays the alignment of amino acid sequence of the predicted transcripts with 
121P1F1. The genomic-based computer programs predict a transcript from genomic sequence, and not only predict 
exons but also set open reading frame as the first forward open reading frame. The predicted transcript does not 
contain 5' or 3' untranslated region (UTR). It starts with ATG and ends with a stop codon, TAG, TGA or TAA. In 
case the transcript is predicted on the reverse strand of the genomic sequence, the sequence of the transcript is 

10 reverse-complemented to the genomic sequence of the exons. Thus, the genomic-based programs provide the 
correct transcript sequence, with 5' to 3* orientation and +1 as the open reading frame. However, due to the 
possibility of inaccurate prediction of exons or the possibility of sequencing errors in genomic data, other peptides 
in other forward open reading frames can also be encoded by the variant. 

To further confirm the parameters of a splice variant, a variety of techniques are available in the art, such 

15 as full-length cloning, proteomic validation, PCR-based validation, and 5' RACE validation, etc. (see e.g., 

Proteomic Validation: Brennan SO, Fellowes AP, George PM.; "Albumin banks peninsula: a new termination 
variant characterised by electrospray mass spectrometry." Biochim Biophys Acta. 1999 Aug 17;1433(l-2):321-6; 
Ferranti P, et al., "Differential splicing of pre-messenger RNA produces multiple forms of mature caprine 
alpha(sl)-casein." Eur J Biochem. 1997 Oct 1;249(1): 1-7; PCR-based Validation: Wellmann S, et al., "Specific 

20 reverse transcription-PCR quantification of vascular endothelial growth factor (VEGF) splice variants by 

LightCycler technology." Clin Chem. 2001 Apr;47(4):654-60; Jia HP, et al., Discovery of new human beta- 
defensins using a genomics-based approach," Gene. 2001 Jan 24;263(1-2):21 1-8; PCR-based and 5' RACE 
Validation: Brigle KE, et al., "Organization of the murine reduced folate carrier gene and identification of variant 
splice forms," Biochim Biophys Acta. 1997 Aug 7; 1353(2): 191-8. 

25 It is known in the art that genomic regions are modulated in cancers. When the genomic region to which 

121P1F1 maps is modulated in a particular cancer, the splice variants of 121 PI Fl are modulated as well. Disclosed 
herein is that 121P1F1 has a particular expression profile. Splice variants of 121 PI Fl that are structurally and/or 
functionally similar to 121 PI Fl share this expression pattern, thus serving as tumor-associated markers/antigens. 
Using the EST assembly approach, we identified four splice variants. They were designated as splice 

30 variant 1 to 4. Splice variant 1 has two potential open reading frames and thus two potential translated peptide 
sequences, designated as 1 A and IB. 

Single Nucleotide Polymorphisms (SNPs) 

A Single Nucleotide Polymorphism (SNP) is a single base pair variation in a nucleotide sequence. As 
35 appreciated by those in the art, in a single nucleotide cange in a codon can cuse the codon to encode a different 

amino acid. Thus a SNP can change amino acids of the protein encoded by the gene and thus change the functions 
of the protein. Some SNPs cause inherited diseases and some others contribute to quantitative variations in 
phenotype and reactions to environmental factors including diet and drugs among individuals. Therefore, the 
occurance of one or more SNPs is relevant in many contexts, including but not limited to diagnosis of inherited or 
40 acquired disease, determination of drug reactions and dosage, identification of genes responsible for disearse and 
discovery of the genetic relationship between individuals (P. Nowotny, J. M. Kwon and A. M. Goate, " SNP 
analysis to dissect human traits," Curr. Opin. Neurobiol. 2001 Oct; 1 1(5):637-641; M. Pirmohamed and B. K. Park, 
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"Genetic susceptibility to adverse drug reactions," Trends Pharmacol. ScL 2001 Jun; 22(6):298-305; J. H. Riley, C. 
J. Allan, E. Lai and A. Roses, "The use of single nucleotide polymorphisms in the isolation of common disease 
genes," Pharmacogenomics. 2000 Feb; l(l):39-47; R. Judson, J. C. Stephens and A. Windemuth, "The predictive 
power of haplotypes in clinical response," Pharmacogenomics. 2000 Feb; l(l):15-26). 

5 SNPs are identified by a variety of art-accepted methods (P. Bean, "The promising voyage of SNP target 

discovery," Am. Clin. Lab. 2001 Oct-Nov; 20(9): 18-20; K. M. Weiss, "In search of human variation," Genome Res. 
1998 Jul; 8(7):69 1-697; M. M. She, "Enabling large-scale pharmacogenetic studies by high-throughput mutation 
detection and genotyping technologies," Clin. Chem. 2001 Feb; 47(2): 164- 172). 

For example, SNPs are identified by sequencing DNA fragments that show polymorphism by gel-based 

10 methods such as restriction fragment length polymorphism (RFLP) and denaturing gradient gel electrophoresis 

(DGGE). SNPs can also be discovered by direct sequencing of DNA samples pooled from different individuals or 
by comparing sequences from different DNA samples. With the accumulation of sequence data in public and 
private databases, one can also discover SNPs by comparing sequences using computer programs (Z. Gu, L. Hillier 
and P. Y. Kwok, "Single nucleotide polymorphism hunting in cyberspace," Hum. Mutat. 1998; 12(4):22 1-225). 

1 5 SNPs can be verified by a variety of methods including direct sequencing and high throughput microarrays (P. Y. 
Kwok, "Methods for genotyping single nucleotide polymorphisms," Annu. Rev. Genomics Hum. Genet. 2001; 
2:235-258; M. Kokoris, K. Dix, K. Moynihan, J. Mathis, B. Erwin, P. Grass, B. Hines and A. Duesterhoeft, "High- 
throughput SNP genotyping with the Masscode system," Mol. Diagn. 2000 Dec; 5(4):329-340). 

As disclosed herein SNPs are identified by directly sequencing cDNA clones and by comparing our 

20 sequences with public and proprietary sequences. By sequencing cDNA clones, SNPs are identified.. By 

comparing these sequences with high quality proprietary or public sequences (e.g, NCBl/GenBank, accesible atjhe 
World Wide Web ( URL www .ncbi.nlm.nih.gov), SNPs are identified. SNPs are identified by aligning variant 
sequences with NCBI genes and ESTs. Typically, only ESTs with over 97% identity are considered; differences 
within 50 base pairs of the ends are not considered. Only SNPs that occur twice from two independent sequences 

25 are included. 

Example 6: Production of Recombinant 121P1F1 in Prokarvotic Systems 

To express recombinant 121P1F1 in prokaryotic cells, the full or partial length 121P1F1 cDNA sequences 
can be cloned into any one of a variety of expression vectors known in the art. One or more of the following 

30 regions of 121 PI Fl are expressed in these constructs: amino acids 1 to 205 of 121 P1F1; amino acids 1-126 of 

splice variant la; amino acids 1-119 of splice variant lb; amino acids 1-122 of splice variant 2; amino acids 1-190 
of splice variant 3; amino acids 1-190 of splice variant 4, or any 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19,20,21, 
22, 23, 24, 25, 26, 27, 28, 29, 30 or more contiguous amino acids from 121 PI Fl, splice variants, or analogs thereof. 
A. In vitro transcription and translation constructs: 

35 pCRII: To generate 121P1F1 sense and anti-sense RNA probes for RNA in situ investigations, pCRII 

constructs (Invitrogen, Carlsbad CA) are generated encoding either all or fragments of the 121 PI Fl cDNA. The 
pCRII vector has Sp6 and T7 promoters flanking the insert to drive the transcription of 1 2 1 P 1 F 1 RNA for use as 
probes in RNA in situ hybridization experiments. These probes are used to analyze the cell and tissue expression of 
121 PI Fl at the RNA level. Transcribed 121P1F1 RNA representing the cDNA amino acid coding region of the 

40 1 2 1 P 1 F 1 gene is used in in vitro translation systems such as the TnT™ Coupled Reticulolysate Sytem (Promega, 
Corp., Madison, WI) to synthesize 121P1F1 protein. 
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B. Bacterial Constructs: 

pGEX Constructs: To generate recombinant 121P1F1 proteins in bacteria that are fused to the Glutathione 
S-transferase (GST) protein, all or parts of the 121P1F1 cDNA protein coding sequence are fused to the GST gene 
by cloning into pGEX-6P-l or any other GST- fusion vector of the pGEX family (Amersham Pharmacia Biotech, 
Piscataway, NJ). These constructs allow controlled expression of recombinant 121P1F1 protein sequences with 
GST fused at the amino-terminus and a six histidine epitope (6X His) at the carboxyl-terminus. The GST and 6X 
His tags permit purification of the recombinant fusion protein from induced bacteria with the appropriate affinity 
matrix and allow recognition of the fusion protein with anti-GST and anti-His antibodies. The 6X His tag is 
generated by adding 6 histidine codons to the cloning primer at the 3' end, e.g., of the open reading frame (ORF). 
A proteolytic cleavage site, such as the PreScission™ recognition site in pGEX-6P-l, can be employed such that it 
permits cleavage of the GST tag from 121 PI Fl -related protein. The ampicillin resistance gene and pBR322 origin 
permits selection and maintenance of the pGEX plasmids in E. coli. In one embodiment, amino acids 1- 1 14 of 
121P1F1 is cloned into the pGEX-6P-l vector, expressed in bacteria, purified, and a 121PlFlcleavage product 
generated utilizing PreScission protease. 

pMAL Constructs: To generate, in bacteria, recombinant 121P1F1 proteins that are fused to maltose- 
binding protein (MBP), all or parts of the 121 PI Fl cDNA protein coding sequence are fused to the MBP gene by 
cloning into the pMAL-c2X and pMAL-p2X vectors (New England Biolabs, Beverly, MA). These constructs allow 
controlled expression of recombinant 121P1F1 protein sequences with MBP fused at the amino-terminus and a 6X 
His epitope tag at the carboxyl-terminus. The MBP and 6X His tags permit purification of the recombinant protein 
from induced bacteria with the appropriate affinity matrix and allow recognition of the fusion protein with anti- 
MBP and anti-His antibodies. The 6X His epitope tag is generated by adding 6 histidine codons to the 3' cloning 
primer. A Factor Xa recognition site permits cleavage of the pMAL tag from 12 1 PI Fl . The pMAL-c2X and 
pMAL-p2X vectors are optimized to express the recombinant protein in the cytoplasm or periplasm respectively. 
Periplasm expression enhances folding of proteins with disulfide bonds. 

pET Constructs : To express 121P1F1 in bacterial cells, all or parts of the 121P1F1 cDNA protein coding 
sequence are cloned into the pET family of vectors (Novagen, Madison, WI). These vectors allow tightly 
controlled expression of recombinant 121P1F1 protein in bacteria with and without fusion to proteins that enhance 
solubility, such as NusA and thioredoxin (Trx), and epitope tags, such as 6X His and S-Tag ™ that aid purification 
and detection of the recombinant protein. For example, constructs are made utilizing pET NusA fusion system 43.1 
such that regions of the 121 PI Fl protein are expressed as amino-terminal fusions to NusA. 
C. Yeast Constructs: 

pESC Constructs : To express 12 1 PI Fl in the yeast species Saccharomyces cerevisiae for generation of 
recombinant protein and functional studies, all or parts of the 121 PI Fl cDNA protein coding sequence are cloned 
into the pESC family of vectors each of which contain 1 of 4 selectable markers, HIS3, TRP1, LEU2, and URA3 
(Stratagene, La Jolla, CA). These vectors allow controlled expression from the same plasmid of up to 2 different 
genes or cloned sequences containing either Flag™ or Myc epitope tags in the same yeast cell. This system is 
useful to confirm protein-protein interactions of 121 PI Fl. In addition, expression in yeast yields similar post- 
translational modifications, such as glycosylations and phosphorylations, that are found when expressed in 
eukaryotic cells. 

pESP Constructs : To express 121P1F1 in the yeast species Saccharomyces pombe, all or parts of the 
121P1F1 cDNA protein coding sequence are cloned into the pESP family of vectors. These vectors allow 
controlled high level of expression of a 121P1F1 protein sequence that is fused at either the amino terminus or at 
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the carboxyl terminus to GST which aids purification of the recombinant protein. A Flag™ epitope tag allows 
detection of the recombinant protein with anti- Flag™ antibody. 

5 Example 7: Production of Recombinant 121P1F1 in Eukarvotic Systems 

A. Mammalian Constructs: 

One or more of the following regions of 121P1F1 are expressed in these constructs: amino acids 1 to 205 
of 121 P1F1; amino acids 1-126 of splice variant la; amino acids 1-1 19 of splice variant lb; amino acids 1-122 of 
splice variant 2; amino acids 1-190 of splice variant 3; amino acids 1-190 of splice variant 4, or any 8, 9, 10, 1 1, 12, 

10 13, 14, 15, 16, 17, 18, 19,20,21,22,23, 24, 25, 26, 27, 28, 29, 30 or more contiguous amino acids from 121P1F1, 
splice variants, or analogs thereof. In certain embodiments a region of 121 PI Fl is expressed that encodes an amino 
acid not shared amongst at least two variants. 

The constructs can be transfected into any one of a wide variety of mammalian cells such as 293T cells. 
Transfected 293T cell lysates can be probed with the anti-121PlFl polyclonal serum, described herein. 

1 5 pcDNA4/HisMax Constructs: To express 1 2 1 P 1 F 1 in mammalian cells, a 1 2 1 P 1 F 1 ORF, or portions 

thereof, of 121 PI Fl are cloned into pcDN A4/H isMax Version A (Invitrogen, Carlsbad, CA). Protein expression is 
driven from the cytomegalovirus (CMV) promoter and the SP16 translational enhancer. The recombinant protein 
has Xpress™ and six histidine (6X His) epitopes fused to the amino-terminus. The pcDNA4/HisMax vector also 
contains the bovine growth hormone (BGH) polyadenylation signal and transcription termination sequence to 

20 enhance mRNA stability along with the SV40 origin for episomal replication and simple vector rescue in cell lines 
expressing the large T antigen. The Zeocin resistance gene allows for selection of mammalian cells expressing the 
protein and the ampicillin resistance gene and ColEl origin permits selection and maintenance of the plasmid in £. 
coli. 

pcDNA3.1/IVlvcHis Constructs: To express 121P1F1 in mammalian cells, a 121P1F1 ORF, or portions 

25 thereof, of 121 PI Fl with a consensus Kozak translation initiation site is cloned into pcDNA3.1/MycHis Version A 
(Invitrogen, Carlsbad, CA). Protein expression is driven from the cytomegalovirus (CMV) promoter. The 
recombinant proteins have the myc epitope and 6X His epitope fused to the carboxyl-terminus. The 
pcDNA3.1/MycHis vector also contains the bovine growth hormone (BGH) polyadenylation signal and 
transcription termination sequence to enhance mRNA stability, along with the SV40 origin for episomal replication 

30 and simple vector rescue in cell lines expressing the large T antigen. The Neomycin resistance gene can be used, as 
it allows for selection of mammalian cells expressing the protein and the ampicillin resistance gene and ColEl 
origin permits selection and maintenance of the plasmid in E. coli. Figure 14 shows expression of 121P1F1 
pcDNA3.1/mychis in transiently infected 293T cells. 

PCDNA3.1/CT-GFP-TOPO Construct: To express 121P1F1 in mammalian cells and to allow detection 

35 of the recombinant proteins using fluorescence, a 1 2 1 P 1 F 1 ORF, or portions thereof, with a consensus Kozak 

translation initiation site are cloned into pcDNA3.1/CT-GFP-TOPO (Invitrogen, CA). Protein expression is driven 
from the cytomegalovirus (CMV) promoter. The recombinant proteins have the Green Fluorescent Protein (GFP) 
fused to the carboxyl-terminus facilitating non-invasive, in vivo detection and cell biology studies. The 
pcDNA3.1CT-GFP-TOPO vector also contains the bovine growth hormone (BGH) polyadenylation signal and 

40 transcription termination sequence to enhance mRNA stability along with the SV40 origin for episomal replication 
and simple vector rescue in cell lines expressing the large T antigen. The Neomycin resistance gene allows for 
selection of mammalian cells that express the protein, and the ampicillin resistance gene and ColEl origin permits 
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selection and maintenance of the plasmid in E. coli. Additional constructs with an amino-terminal GFP fusion are 
made in pcDNA3.1/NT-GFP-TOPO spanning the entire length of a 121P1F1 protein. 

PAPtag: A 121 PI Fl ORF, or portions thereof, is cloned into pAPtag-5 (GenHunter Corp. Nashville, TN). 
This construct generates an alkaline phosphatase fusion at the carboxyl-terminus of a 121P1F1 protein while fusing 
5 the IgGK signal sequence to the ami no-terminus. Constructs are also generated in which alkaline phosphatase with 
an amino-terminal IgGK signal sequence is fused to the amino-terminus of a 121 PI Fl protein. The resulting 
recombinant 121P1F1 proteins are optimized for secretion into the media of transfected mammalian cells and can 
be used to identify proteins such as ligands or receptors that interact with 121P1F1 proteins. Protein expression is 
driven from the CMV promoter and the recombinant proteins also contain myc and 6X His epitopes fused at the 

10 carboxyl-terminus that facilitates detection and purification. The Zeocin resistance gene present in the vector 
allows for selection of mammalian cells expressing the recombinant protein and the ampicillin resistance gene 
permits selection of the plasmid in E. coli. 

ptagS: A 121 PI Fl ORF, or portions thereof, is cloned into pTag-5. This vector is similar to pAPtag but 
without the alkaline phosphatase fusion. This construct generates 121 P1F1 protein with an amino-terminal IgGK 

15 signal sequence and myc and 6X His epitope tags at the carboxyl-terminus that facilitate detection and affinity 
purification. The resulting recombinant 121P1F1 protein is optimized for secretion into the media of transfected 
mammalian cells, and is used as immunogen or ligand to identify proteins such as ligands or receptors that interact 
with the 121P1F1 proteins. Protein expression is driven from the CMV promoter. The Zeocin resistance gene 
present in the vector allows for selection of mammalian cells expressing the protein, and the ampicillin resistance 

20 gene permits selection of the plasmid in E. coli. 

PsecFc: A 121 PI Fl ORF, or portions thereof, is also cloned into psecFc. The psecFc vector was 
assembled by cloning the human immunoglobulin Gl (IgG) Fc (hinge, CH2, CH3 regions) into pSecTag2 
(Invitrogen, California). This construct generates an IgGl Fc fusion at the carboxyl-terminus of the 121 PI Fl 
proteins, while fusing the IgGK signal sequence to N-terminus. 121P1F1 fusions utilizing the murine IgGl Fc 

25 region are also used. The resulting recombinant 121 PI Fl proteins are optimized for secretion into the media of 
transfected mammalian cells, and can be used as immunogens or to identify proteins such as ligands or receptors 
that interact with 121P1F1 protein. Protein expression is driven from the CMV promoter. The hygromycin 
resistance gene present in the vector allows for selection of mammalian cells that express the recombinant protein, 
and the ampicillin resistance gene permits selection of the plasmid in E. coli. 

30 pSRa Constructs: To generate mammalian cell lines that express 121P1F1 constitutively, 121P1F1 ORF, 

or portions thereof, of 121 PI Fl are cloned into pSRa constructs. Amphotropic and ecotropic retroviruses are 
generated by transfection of pSRa constructs into the 293T-10A1 packaging line or co-transfection of pSRa and a 
helper plasmid (containing deleted packaging sequences) into the 293 cells, respectively. The retrovirus is used to 
infect a variety of mammalian cell lines, resulting in the integration of the cloned gene, 121P1F1, into the host cell- 

35 lines. Protein expression is driven from a long terminal repeat (LTR). The Neomycin resistance gene present in the 
vector allows for selection of mammalian cells that express the protein, and the ampicillin resistance gene and 
ColEl origin permit selection and maintenance of the plasmid in £. coli. The retroviral vectors can thereafter be 
used for infection and generation of various cell lines using, for example, PC3, TMIH 3T3, TsuPrl, 293 or rat-1 cells. 
Additional pSRa constructs are made that fuse an epitope tag such as the FLAG™ tag to the carboxyl- 

40 terminus of 1 2 1 P 1 F 1 sequences to allow detection using anti-Flag antibodies. For example, the FLAG™ sequence 
5' gat tac aag gat gac gac gat aag 3' (SEQ ID NO: 40 ) is added to cloning primer at the 3' end of the ORF. 

69 



511582003440 



Additional pSRct constructs are made to produce both amino-terminal and carboxyl-terminal GFP and myc/6X His 
fusion proteins of the full-length 121P1F1 proteins. 

Additional Viral Vectors: Additional constructs are made for viral-mediated delivery and expression of 
121P1F1. High virus titer leading to high level expression of 121P1F1 is achieved in viral delivery systems such as 
5 adenoviral vectors and herpes amplicon vectors. A 1 2 1 P 1 F 1 coding sequences or fragments thereof are amplified 
by PCR and subcloned into the AdEasy shuttle vector (Stratagene). Recombination and virus packaging are 
performed according to the manufacturer's instructions to generate adenoviral vectors. Alternatively, 121P1F1 
coding sequences or fragments thereof are cloned into the HSV-1 vector (Imgenex) to generate herpes viral vectors. 
The viral vectors are thereafter used for infection of various cell lines such as PC3, NIH 3T3, 293 or rat-1 cells. 

10 Regulated Expression Systems: To control expression of 121 PI Fl in mammalian cells, coding 

sequences of 121 PI Fl, or portions thereof, are cloned into regulated mammalian expression systems such as the T- 
Rex System (Invitrogen), the GeneSwitch System (Invitrogen) and the tightly-regulated Ecdysone System 
(Sratagene). These systems allow the study of the temporal and concentration dependent effects of recombinant 
121P1F1. These vectors are thereafter used to control expression of 121P1F1 in various cell lines such as PC3, NIH 

15 3T3, 293 or rat-1 cells. 

B. Baculovirus Expression Systems 

To generate recombinant 121 P1F1 proteins in a baculovirus expression system, 121P1F1 ORF, or portions 
thereof, are cloned into the baculovirus transfer vector pBlueBac 4.5 (Invitrogen), which provides a His-tag at the 
N-terminus. Specifically, pBlueBac-121PlFl is co-transfected with helper plasmid pBac-N-Blue (Invitrogen) into 
20 SF9 (Spodoptera frugiperda) insect cells to generate recombinant baculovirus (see Invitrogen instruction manual for 
details). Baculovirus is then collected from cell supernatant and purified by plaque assay. 

Recombinant 121P1F1 protein is then generated by infection of HighFive insect cells (Invitrogen) with 
purified baculovirus. Recombinant 121P1F1 protein can be detected using anti-121PlFl or anti-His-tag antibody. 
121P1F1 protein can be purified and used in various cell-based assays or as immunogen to generate polyclonal and 
25 monoclonal antibodies specific for 1 2 1 P 1 F 1 . 

Example 8: Antigenicity Profiles and Secondary Structure 

Figure 5A, Figure 6A, Figure 7A, Figure 8A, and Figure 9A depict graphically five amino acid profiles of 
30 the 121P1F1 amino acid sequence; Figure 5B, Figure 6B, Figure 7B, Figure 8B, and Figure 9B depict graphically 

five amino acid profiles of the 12 IP 1F1 variant 1A amino acid sequence. Each assessment is available by accessing 
the ProtScale website located on the World Wide Web at (^Rt www.expasy.ch/cgi-bin/protscale. pi) on the ExPasy 
molecular biology server. 

These profiles: Figure 5, Hydrophilicity, (Hopp T.P., Woods K.R., 1981. Proc. Natl. Acad. Sci. U.S.A. 

35 78:3824-3828); Figure 6, Hydropathic ity, (Kyte J., Doolittle R.F., 1982. J. Mol. Biol. 157:105-132); Figure 7, 

Percentage Accessible Residues (Janin J., 1979 Nature 277:491-492); Figure 8, Average Flexibility, (Bhaskaran R., 
and Ponnuswamy P.K., 1988. Int. J. Pept. Protein Res. 32:242-255); Figure 9, Beta-turn (Deleage, G., Roux B. 1987 
Protein Engineering 1 :289-294); and optionally others available in the art, such as on the ProtScale website, were 
used to identify antigenic regions of the 121 PI Fl protein and variant 1 A. Each of the above amino acid profiles of 

40 121P1F1 were generated using the following ProtScale parameters for analysis: 1) A window size of 9; 2) 100% 
weight of the window edges compared to the window center; and 3) amino acid profile values normalized to lie 
between 0 and 1 . 
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Hydrophilicity (Figure 5), Hydropathicity (Figure 6) and Percentage Accessible Residues (Figure 7) 
profiles were used to determine stretches of hydrophilic amino acids (i.e., values greater than 0.5 on the 
Hydrophilicity and Percentage Accessible Residues profile, and values less than 0.5 on the Hydropathicity profile). 
Such regions are likely to be exposed to the aqueous environment, be present on the surface of the protein, and thus 
5 available for immune recognition, such as by antibodies. 

Average Flexibility (Figure 8) and Beta-turn (Figure 9) profiles determine stretches of amino acids (i.e., 
values greater than 0.5 on the Beta-turn profile and the Average Flexibility profile) that are not constrained in 
secondary structures such as beta sheets and alpha helices. Such regions are also more likely to be exposed on the 
protein and thus accessible to immune recognition, such as by antibodies. 

10 Antigenic sequences of the full length 121P1F1 protein indicated, e.g., by the profiles set forth in Figure 

5A, Figure 6A, Figure 7A, Figure 8A, and/or Figure 9A are used to prepare immunogens, either peptides or nucleic 
acids that encode them, to generate therapeutic and diagnostic anti-121PlFl antibodies. Antigenic sequences of the 
121P1F1 variant 1 A protein indicated, e.g., by the profiles set forth in Figure 5B, Figure 6B, Figure 7B, Figure 8B, 
and/or Figure 9B are used to prepare immunogens, either peptides or nucleic acids that encode them, to generate 

15 therapeutic and diagnostic anti-121PlFl-variant 1 A antibodies. The immunogen can be any 5, 6, 7, 8, 9, 10, 1 1, 12, 
13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50 or more than 50 contiguous amino acids, or the 
corresponding nucleic acids that encode them, from the 121 PI Fl protein or from variants la, lb, 2, 3, or 4 (see 
Figures 2 and 3). In particular, peptide immunogens of the invention can comprise, a peptide region of at least 5 
amino acids of Figure 2 in any whole number increment up to 205 that includes an amino acid position having a 

20 value greater than 0.5 in the Hydrophilicity profile of Figure 5; a peptide region of at least 5 amino acids of Figure 2 
in any whole number increment up to 205 that includes an amino acid position having a value less than 0.5 in the 
Hydropathicity profile of Figure 6; a peptide region of at least 5 amino acids of Figure 2 in any whole number 
increment up to 205 that includes an amino acid position having a value greater than 0.5 in the Percent Accessible 
Residues profile of Figure 7; a peptide region of at least 5 amino acids of Figure 2 in any whole number increment 

25 up to 205 that includes an amino acid position having a value greater than 0.5 in the Average Flexibility profile on 
Figure 8; and, a peptide region of at least 5 amino acids of Figure 2 in any whole number increment up to 205 that 
includes an amino acid position having a value greater than 0.5 in the Beta-turn profile of Figure 9. Peptide 
immunogens of the invention can also comprise nucleic acids that encode any of the forgoing. In addition, peptide 
immunogens can comprise amino acids of variant la, that contain characteristics of the above mentioned parameters 

30 set forth in Figure 5B, Figure 6B, Figure 7B, Figure 8B, or Figure 9B. 

All immunogens of the invention, peptide or nucleic acid, can be embodied in human unit dose form, or 
comprised by a composition that includes a pharmaceutical excipient compatible with human physiology. 

The secondary structure of 121 PI Fl, namely the predicted presence and location of alpha helices, extended 
strands, and random coils, is predicted from the primary amino acid sequence using the HNN - Hierarchical Neural 

35 Network method (Guermeur, 1997, Web URL pbil.ibcp.fr/cgi-bin/npsa_automat.pl?page=npsa_nn.html), accessed 
from the ExPasy molecular biology serve r located on the World Wide Web at ( Wob URL www .expasy.ch/tools/). 
The analysis indicates that 121P1F1 is composed 61.95% alpha helix, 1 .95% extended strand, and 36.10% random 
coil (Figure 16A). The secondary structure of variant la is presented in Figure 16B. 

Analysis of 121 PI Fl using a variety of transmembrane prediction algorithms accessed from the ExPasy 

40 molecular biology serve r located on the World Wide Web at ( Wob URL www.expasy.ch/tools/) did not predict the 
presence of such domains, suggesting that 121P1F1 and the variants are soluble proteins. 
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Example 9: Generation of 121P1F1 Polyclonal Antibodies 

Polyclonal antibodies can be raised in a mammal, for example, by one or more injections of an immunizing 
agent and, if desired, an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal 
by multiple subcutaneous or intraperitoneal injections. In addition to immunizing with the full length 12 1 PI Fl 
protein, computer algorithms are employed in design of immunogens that, based on amino acid sequence analysis, 
contain characteristics of being antigenic and available for recognition by the immune system of the immunized 
host (see the Example entitled "Antigenicity Profiles"). Such regions would be predicted to be hydrophilic, 
flexible, in beta-turn conformations, and be exposed on the surface of the protein (see, e.g., Figure 5A, Figure 6A, 
Figure 7A, Figure 8A, or Figure 9A for amino acid profiles that indicate such regions of 121P1F1; and Figure 5B, 
Figure 6B, Figure 7B, Figure 8B, or Figure 9B for amino acid profiles that indicate such regions of 121 PI Fl variant 
la). 

For example, 121P1F1 recombinant bacterial fusion proteins or peptides containing hydrophilic, flexible, 
beta-turn regions of 121 PI Fl or of the variants are used as antigens to generate polyclonal antibodies in New 
Zealand White rabbits. For example, such regions include, but are not limited to, amino acids 1-50 and amino acids 
90-160 of 121 PI Fl. In addition, immunogens are designed to encode regions that are novel to particular variants of 
121P1F1, such as amino acids 93-126 of variant la, amino acids 1-6 of variant lb, and amino acids 1 17-122 of 
variant 2. Antibodies to these regions are useful to distinguish between 12 1 PI Fl and its splice variants. It is useful 
to conjugate the immunizing agent to a protein known to be immunogenic in the mammal being immunized. 
Examples of such immunogenic proteins include, but are not limited to, keyhole limpet hemocyanin (KLH), serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. In one embodiment, a peptide encoding amino acids 
1-25 of 121P1F1 is conjugated to KLH and used to immunize the rabbit. Alternatively the immunizing agent can 
include all or portions of the 121 PI Fl protein, analogs or fusion proteins thereof. For example, the 121 PI Fl amino 
acid sequence can be fused using recombinant DNA techniques to any one of a variety of fusion protein partners 
that are well known in the art, such as glutathione-S-transferase (GST) and HIS tagged fusion proteins. Such fusion 
proteins are purified from induced bacteria using the appropriate affinity matrix. 

In one embodiment, a GST-fusion protein encoding amino acids 1-114 of 121P1F1 coding sequence is 
produced, purified, and a proteolytic cleavage product in which GST sequences are removed is used as immunogen. 
Other recombinant bacterial fusion proteins that can be employed include maltose binding protein, LacZ, 
thioredoxin, NusA, or an immunoglobulin constant region (see the section entitled "Production of 121 PI Fl in 
Prokaryotic Systems" and Current Protocols In Molecular Biology, Volume 2, Unit 16, Frederick M. Ausubul et al. 
eds., 1995; Linsley, P.S., Brady, W., Urnes, M., Grosmaire, L., Damle, N., and Ledbetter, L.(1991) J.Exp. Med. 
174, 561-566). 

In addition to bacterial derived fusion proteins, mammalian expressed protein antigens are also used. 
These antigens are expressed from mammalian expression vectors such as the Tag5 and Fc-fusion vectors (see the 
section entitled "Production of Recombinant 121 PI Fl in Eukaryotic Systems"), and retain post-trans lational 
modifications such as glycosylations found in native protein. In one embodiment, the entire 121P1F1 coding 
sequence is cloned into the Tag5 mammalian secretion vector. The recombinant protein is purified by metal chelate 
chromatography from tissue culture supernatants of 293T cells stably expressing the recombinant vector. The 
purified Tag5 121P1F1 protein is then used as immunogen. 
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During the immunization protocol, it is useful to mix or emulsify the antigen in adjuvants that enhance the 
immune response of the host animal. Examples of adjuvants include, but are not limited to, complete Freund's 
adjuvant (CFA) and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

In a typical protocol, rabbits are initially immunized subcutaneously with up to 200 ug, typically 100-200 
5 jig, of fusion protein or peptide conjugated to KLH mixed in complete Freund's adjuvant (CFA). Rabbits are then 
injected subcutaneously every two weeks with up to 200 ug, typically 100-200 jig, of the immunogen in incomplete 
Freund's adjuvant (IFA). Test bleeds are taken approximately 7-10 days following each immunization and used to 
monitor the titer of the antiserum by ELISA. 

The reactivity of serum from immunized animals is tested by various immunoassays, such as ELISA, 

10 Western blot, immunofluorescence microscopy, and flow cytometry. The reactivity of the ant i -GST-cleavage 

product serum was tested by Western blot using various amounts of immunogen; see Figure 12, which shows strong 
and specific reactivity of the serum to the cleavage antigen. Antiserum is then purified by various affinity 
chromatography techniques. 

The anti-serum from the GST-fusion cleavage immunogen is affinity purified by passage over a column 

1 5 composed of the GST-cleavage antigen covalently coupled to Afflgel matrix (BioRad, Hercules, Calif.). The serum 
is then further purified by protein G affinity chromatography to isolate the IgG fraction. Serum from rabbits 
immunized with whole fusion proteins, such as GST and MBP fusion proteins, are purified by depletion of 
antibodies reactive to the fusion partner sequence by passage over an affinity column containing the fusion partner 
either alone or in the context of an irrelevant fusion protein. Sera from other His-tagged antigens and peptide 

20 immunized rabbits as well as fusion partner depleted sera are affinity purified by passage over a column matrix 
composed of the original protein immunogen or free peptide. 

Both crude and affinity purified polyclonal antibodies are further tested by various immunoassays against 
both recombinant cells and cells and tissues that endogenously express 121P1F1. To generate recombinant 
121P1F1 cells, the full-length 121P1F1 cDNA is cloned into pCDNA 3.1 Myc-His expression vector (Invitrogen, 

25 see the Example entitled "Production of Recombinant 121P1F1 in Eukaryotic Systems"). After transfection of the 
construct into 293T cells, cell lysates were probed with the anti-121PlFl polyclonal antibody (Figure 13) and with 
anti-His antibody (Santa Cruz Biotechnologies, Santa Cruz, CA) (Figure 14) demonstrating specific reactivity to 
denatured 121P1F1 protein using the Western blot technique. The polyclonal antibody was also used to test a panel 
of tumor cell lines by Western analysis, for which the results are also shown in Figure 13. The polyclonal antibody 

30 shows strong reactivity to MYC-HIS tagged 121P1F1 in transfected 293T cells and also to several proteins in the 
tumor cell lines, indicating reactivity to endogenous 121P1F1 and to variant molecules of different molecular 
weights. In addition, immunoprecipitation, fluorescent microscopy, immunohistochemistry, and flow cytometric 
techniques on recombinant cells and patient tissues samples are used to characterize 121 PI Fl protein expression 
using the polyclonal antibody. 

35 

Example 10: Generation of 121P1F1 Monoclonal Antibodies (mAbs) 

In one embodiment, therapeutic mAbs to 121 PI Fl comprise those that react with epitopes of the protein 
that would disrupt or modulate the biological function of 121 PI Fl, for example those that would disrupt its 
40 interaction with ligands, proteins, or substrates that mediate its biological activity. Immunogens for generation of 
such mAbs include those designed to encode or contain the entire 121P1F1 protein or its variants or regions of the 
121P1F1 protein or its variants predicted to be antigenic from computer analysis of the amino acid sequence (see, 
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e.g., Figure 5, Figure 6, Figure 7, Figure 8, or Figure 9, and the Example entitled "Antigenicity Profiles"). 
Immunogens include peptides, recombinant bacterial proteins, and mammalian expressed Tag 5 proteins and human 
and murine IgG FC fusion proteins. In addition, cells expressing high levels of 121P1F1, such as 293T-121P1F1 or 
300.19-12 1P1F1 murine Pre-B cells, are used to immunize mice. 
5 To generate mAbs to 121P1F1, mice are first immunized intraperitoneal ly (IP) with, typically, 10-50 ug of 

protein immunogen or 10 7 121 PI Fl -expressing cells mixed in complete Freund's adjuvant. Mice are then 
subsequently immunized IP every 2-4 weeks with, typically, 10-50 ug of protein immunogen or 10 7 cells mixed in 
incomplete Freund's adjuvant. Alternatively, MPL-TDM adjuvant is used in immunizations. In addition to the 
above protein and cell-based immunization strategies, a DNA-based immunization protocol is employed in which a 

10 mammalian expression vector encoding 121P1F1 sequence is used to immunize mice by direct injection of the 
plasmid DNA. For example, the entire coding sequence of 121P1F1, amino acids 1-205, is cloned into the Tag5 
mammalian secretion vector and the recombinant vector is used as immunogen. In another example the same 
amino acids are cloned into an Fc-fusion secretion vector in which the 121P1F1 sequence is fused at the amino- 
terminus to an IgK leader sequence and at the carboxyl-terminus to the coding sequence of the human or murine 

15 IgG Fc region. This recombinant vector is then used as immunogen. The plasmid immunization protocols are used 
in combination with purified proteins expressed from the same vector and with cells expressing 121P1F1. In 
another embodiment the GST-fusion cleavage protein described in Example 8 is used as immunogen. 

During the immunization protocol, test bleeds are taken 7-10 days following an injection to monitor titer 
and specificity of the immune response. Once appropriate reactivity and specificity is obtained as determined by 

20 ELISA, Western blotting, immunoprecipitation, fluorescence microscopy, and flow cytometric analyses, fusion and 
hybridoma generation is then carried out with established procedures well known in the art (see, e.g., Antibodies: A 
Laboratory Manual, CSH Press, Eds., Harlow, and Lane (1988)). 

In one embodiment, monoclonal antibodies are derived that distinguish variant la from 12 1 PI Fl and the 
other variants. For example, a Tag5 protein encoding amino acids 93-126 of variant la is produced and purified 

25 from the supernatants of 293T cells transfected with the cognate Tag5 cDNA vector. Balb C mice are initially 

immunized intraperitoneal ly with 25 ug of the Tag5-variant la protein mixed in complete Freund's adjuvant. Mice 
are subsequently immunized every two weeks with 25 ug of the antigen mixed in incomplete Freund's adjuvant for 
a total of three immunizations. ELISA using the Tag5 antigen determines the titer of serum from immunized mice. 
Reactivity and specificity of serum to the full length variant la protein is monitored by Western blotting, 

30 immunoprecipitation and flow cytometry using 293T cells transfected with an expression vector encoding the 

variant la cDNA (see e.g., the Example entitled "Production of Recombinant 121P1F1 in Eukaryotic Systems"). 
Other recombinant variant la-expressing cells or cells endogenous ly expressing variant la are also used. 
Specificity is also determined by lack of reactivity to cells expressing 12 1 PI Fl and the other variants. Mice 
showing the strongest reactivity to variant 1 a are rested and given a final injection of Tag5 antigen in PBS and then 

35 sacrificed four days later. The spleens of the sacrificed mice are harvested and fused to SPO/2 myeloma cells using 
standard procedures (Harlow and Lane, 1988). Supernatants from HAT selected growth wells are screened by 
ELISA, Western blot, immunoprecipitation, fluorescent microscopy, and flow cytometry to identify 121P1F1 
specific antibody-producing clones. Monoclonal antibodies are also raised that distinguish variant lb and variant 2 
from each other, from variants 3 and 4 and from 121P1F1. This is accomplished through immunization with 

40 antigens, such as KLH-coupIed peptides, that encode amino acids specific to variant lb (amino acids 1-6) and 
variant 2 (amino acids 1 18-122). 
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The binding affinity of a 121 PI Fl monoclonal antibody is determined using standard technologies. 
Affinity measurements quantify the strength of antibody to epitope binding and are used to help define which 
121P1F1 monoclonal antibodies preferred for diagnostic or therapeutic use, as appreciated by one of skill in the art. 
The BIAcore system (Uppsala, Sweden) is a preferred method for determining binding affinity. The BIAcore 
5 system uses surface plasmon resonance (SPR, Welford K. 1991, Opt. Quant. Elect. 23:1; Morton and Myszka, 
1998, Methods in Enzymology 295: 268) to monitor biomolecular interactions in real time. BIAcore analysis 
conveniently generates association rate constants, dissociation rate constants, equilibrium dissociation constants, 
and affinity constants. 

10 

Example 11: HLA Class 1 and Class II Binding Assays 

HLA class I and class II binding assays using purified HLA molecules are performed in accordance with 
disclosed protocols (e.g., PCT publications WO 94/20127 and WO 94/03205; Sidney et al. 9 Current Protocols in 
Immunology 18.3.1 (1998); Sidney, et ai, J. Immunol. 154:247 (1995); Sette, et al., Mol. Immunol. 31:813 (1994)). 

1 5 Briefly, purified MHC molecules (5 to 500 nM) are incubated with various unlabeled peptide inhibitors and 1-10 
nM ,25 I-radiolabeled probe peptides as described. Following incubation, MHC-peptide complexes are separated 
from free peptide by gel filtration and the fraction of peptide bound is determined. Typically, in preliminary 
experiments, each MHC preparation is titered in the presence of fixed amounts of radiolabeled peptides to 
determine the concentration of HLA molecules necessary to bind 10-20% of the total radioactivity. All subsequent 

20 inhibition and direct binding assays are performed using these HLA concentrations. 

Since under these conditions [label]<[HLAl and IC 50 >[HLA], the measured IC 50 values are reasonable 
approximations of the true K D values. Peptide inhibitors are typically tested at concentrations ranging from 120 
ug/ml to 1.2 ng/ml, and are tested in two to four completely independent experiments. To allow comparison of the 
data obtained in different experiments, a relative binding figure is calculated for each peptide by dividing the IC 50 of 

25 a positive control for inhibition by the IC 50 for each tested peptide (typically unlabeled versions of the radiolabeled 
probe peptide). For database purposes, and inter-experiment comparisons, relative binding values are compiled. 
These values can subsequently be converted back into IC 50 nM values by dividing the IC 50 nM of the positive 
controls for inhibition by the relative binding of the peptide of interest. This method of data compilation is accurate 
and consistent for comparing peptides that have been tested on different days, or with different lots of purified 

30 MHC. 

Binding assays as outlined above may be used to analyze HLA supermotif and/or HLA motif-bearing 

peptides. 

Example 12: Identification of HLA Supermotif- and Motif-Bearing CTL Candidate Epitopes 
35 HLA vaccine compositions of the invention can include multiple epitopes. The multiple epitopes can 

comprise multiple HLA supermotifs or motifs to achieve broad population coverage. This example illustrates the 
identification and confirmation of supermotif- and motif-bearing epitopes for the inclusion in such a vaccine 
composition. Calculation of population coverage is performed using the strategy described below. 

Computer searches and algorithms for identification of supermotif and/or motif- bearing epitopes 
40 The searches performed to identify the motif-bearing peptide sequences in the Example entitled 

"Antigenicity Profiles" and Tables V-XVII1, XXVI, and XXVII employ the protein sequence data from the gene 
product of 121 PI Fl set forth in Figures 2 and 3. 
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Computer searches for epitopes bearing HLA Class I or Class II supermotifs or motifs are performed as 
follows. All translated 121P1F1 protein sequences are analyzed using a text string search software program to 
identify potential peptide sequences containing appropriate HLA binding motifs; such programs are readily 
produced in accordance with information in the art in view of known motif/supermotif disclosures. Furthermore, 
5 such calculations can be made mentally. 

Identified A2-, A3-, and DR-supermotif sequences are scored using polynomial algorithms to predict their 
capacity to bind to specific HLA-Class I or Class II molecules. These polynomial algorithms account for the impact 
of different amino acids at different positions, and are essentially based on the premise that the overall affinity (or 
AG) of peptide-HLA molecule interactions can be approximated as a linear polynomial function of the type: 

1 0 "AG" = a„ x a 2 , x a 3/ x a„ y 

where a,, is a coefficient which represents the effect of the presence of a given amino acid (J) at a given position (i) 
along the sequence of a peptide of n amino acids. The crucial assumption of this method is that the effects at each 
position are essentially independent of each other (i.e., independent binding of individual side-chains). When 
residue j occurs at position / in the peptide, it is assumed to contribute a constant amount j, to the free energy of 

1 5 binding of the peptide irrespective of the sequence of the rest of the peptide. 

The method of derivation of specific algorithm coefficients has been described in Gulukota et al. 9 J. Mol. 
Biol. 267:1258-126, 1997; (see also Sidney et aL, Human Immunol. 45:79-93, 1996; and Southwood et al., J. 
Immunol. 160:3363-3373, 1998). Briefly, for all / positions, anchor and non-anchor alike, the geometric mean of 
the average relative binding (ARB) of all peptides carrying./ is calculated relative to the remainder of the group, and 

20 used as the estimate of/,. For Class II peptides, if multiple alignments are possible, only the highest scoring 

alignment is utilized, following an iterative procedure. To calculate an algorithm score of a given peptide in a test 
set, the ARB values corresponding to the sequence of the peptide are multiplied. If this product exceeds a chosen 
threshold, the peptide is predicted to bind. Appropriate thresholds are chosen as a function of the degree of 
stringency of prediction desired. 

25 Selection of HLA-A2 supertvpe cross-reactive peptides 

Protein sequences from 121P1F1 are scanned utilizing motif identification software, to identify 8-, 9- 10- 
and 1 1-mer sequences containing the HLA-A2-supermotif main anchor specificity. Typically, these sequences are 
then scored using the protocol described above and the peptides corresponding to the positive-scoring sequences are 
synthesized and tested for their capacity to bind purified HLA-A*0201 molecules in vitro (HLA-A*0201 is 

30 considered a prototype A2 supertype molecule). 

These peptides are then tested for the capacity to bind to additional A2-supertype molecules (A*0202, 
A*0203, A*0206, and A*6802). Peptides that bind to at least three of the five A2-supertype alleles tested are 
typically deemed A2-supertype cross-reactive binders. Preferred peptides bind at an affinity equal to or less than 
500 nM to three or more HLA-A2 supertype molecules. 

35 Selection of HLA-A3 supermotif-bearing epitopes 

The 121P1F1 protein sequence(s) scanned above is also examined for the presence of peptides with the 
HLA- A3-supermotif primary anchors. Peptides corresponding to the HLA A3 supermotif-bearing sequences are 
then synthesized and tested for binding to HLA-A*0301 and HLA-A* 1101 molecules, the molecules encoded by 
the two most prevalent A3-supertype alleles. The peptides that bind at least one of the two alleles with binding 

40 affinities of <500 nM, often < 200 nM, are then tested for binding cross-reactivity to the other common A3- 

supertype alleles (e.g., A*3101, A*3301, and A*6801) to identify those that can bind at least three of the five HLA- 
A3-supertype molecules tested. 
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Selection of HLA-B7 supermotif bearing epitopes 

The 121P1F1 protein(s) scanned above is also analyzed for the presence of 8-, 9- 10-, or 1 1-mer peptides 
with the HLA-B7-supermotif. Corresponding peptides are synthesized and tested for binding to HLA-B*0702, the 
molecule encoded by the most common B7-supertype allele (i.e., the prototype B7 supertype allele). Peptides 
5 binding B*0702 with 1C 50 of <500 nM are identified using standard methods. These peptides are then tested for 
binding to other common B7-supertype molecules (e.g., B*3501, B*5101, B*5301, and B*5401). Peptides capable 
of binding to three or more of the five B7-supertype alleles tested are thereby identified. 

Selection of Al and A24 motif-bearing epitopes 

To further increase population coverage, HLA-A1 and -A24 epitopes can also be incorporated into vaccine 
10 compositions. An analysis of the 12 1 PI Fl protein can also be performed to identify HLA-A1- and A24-motif- 
containing sequences. 

High affinity and/or cross-reactive binding epitopes that bear other motif and/or supermotifs are identified 
using analogous methodology. 

15 Example 13: Confirmation of Immunogenicitv 

Cross-reactive candidate CTL A2-supermotif-bearing peptides that are identified as described herein are 
selected to confirm in vitro immunogenicity. Confirmation is performed using the following methodology: 
Target Cell Lines for Cellular Screening: 

The .221 A2.1 cell line, produced by transferring the HLA-A2.1 gene into the HLA-A, -B, -C null mutant 

20 human B-lymphoblastoid cell line 72 1 .22 1 , is used as the peptide-loaded target to measure activity of HLA-A2. 1 - 
restricted CTL. This cell line is grown in RPMI-1640 medium supplemented with antibiotics, sodium pyruvate, 
nonessential amino acids and 10% (v/v) heat inactivated FCS. Cells that express an antigen of interest, or 
transfectants comprising the gene encoding the antigen of interest, can be used as target cells to confirm the ability 
of peptide-specific CTLs to recognize endogenous antigen. 

25 Primary CTL Induction Cultures: 

Generation of Dendritic Cells (DC): PBMCs are thawed in RPMI with 30 Kig/ml DNAse, washed twice 
and resuspended in complete medium (RPMI-1640 plus 5% AB human serum, non-essential amino acids, sodium 
pyruvate, L-glutamine and penicillin/streptomycin). The monocytes are purified by plating 10 x 10 6 PBMC/well in 
a 6-well plate. After 2 hours at 37°C, the non-adherent cells are removed by gently shaking the plates and 

30 aspirating the supernatants. The wells are washed a total of three times with 3 ml RPMI to remove most of the non- 
adherent and loosely adherent cells. Three ml of complete medium containing 50 ng/ml of GM-CSF and 1,000 U/ml 
of IL-4 are then added to each well. TNFa is added to the DCs on day 6 at 75 ng/ml and the cells are used for CTL 
induction cultures on day 7. 

Induction of CTL with DC and Peptide: CD8+ T-cells are isolated by positive selection with Dynal 

35 immunomagnetic beads (Dynabeads® M-450) and the detacha-bead® reagent. Typically about 200-250x 1 0 6 

PBMC are processed to obtain 24x1 0 6 CD8 + T-cells (enough for a 48-well plate culture). Briefly, the PBMCs are 
thawed in RPMI with 30ug/ml DNAse, washed once with PBS containing 1% human AB serum and resuspended in 
PBS/1% AB serum at a concentration of 20x1 0 6 cel Is/ml. The magnetic beads are washed 3 times with PBS/AB 
serum, added to the cells (140ul beads/20xl0 6 cells) and incubated for 1 hour at 4°C with continuous mixing. The 

40 beads and cells are washed 4x with PBS/AB serum to remove the nonadherent cells and resuspended at lOOxlO 6 
cells/ml (based on the original cell number) in PBS/AB serum containing lOOnl/ml detacha-bead® reagent and 30 
Hg/ml DNAse. The mixture is incubated for 1 hour at room temperature with continuous mixing. The beads are 
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washed again with PBS/AB/DNAse to collect the CD8+ T-cells. The DC are collected and centrifuged at 1300 rpm 
for 5-7 minutes, washed once with PBS with 1% BSA, counted and pulsed with 40ug/ml of peptide at a cell 
concentration of l-2xl0 6 /ml in the presence of 3ug/ml B 2 - microglobulin for 4 hours at 20°C. The DC are then 
irradiated (4,200 rads), washed 1 time with medium and counted again. 
5 Setting up induction cultures: 0.25 ml cytokine-generated DC (at 1x10 s cells/ml) are co-cultured with 

0.25ml of CD8+ T-cells (at 2xl0 6 cell/ml) in each well of a 48-well plate in the presence of 10 ng/ml of IL-7. 
Recombinant human 1L-10 is added the next day at a final concentration of 10 ng/ml and rhuman 1L-2 is added 48 
hours later at 10 IU/ml. 

Restimulation of the induction cultures with peptide-pulsed adherent cells: Seven and fourteen days after 
10 the primary induction, the cells are restimulated with peptide-pulsed adherent cells. The PBMCs are thawed and 
washed twice with RPMI and DNAse. The cells are resuspended at 5xl0 6 cells/ml and irradiated at -4200 rads. 
The PBMCs are plated at 2xl0 6 in 0.5 ml complete medium per well and incubated for 2 hours at 37°C. The plates 
are washed twice with RPMI by tapping the plate gently to remove the nonadherent cells and the adherent cells 
pulsed with lOug/ml of peptide in the presence of 3 ug/ml B 2 microglobulin in 0.25ml RPMI/5%AB per well for 2 
1 5 hours at 37°C. Peptide solution from each well is aspirated and the wells are washed once with RPMI. Most of the 
media is aspirated from the induction cultures (CD8+ cells) and brought to 0.5 ml with fresh media. The cells are 
then transferred to the wells containing the peptide-pulsed adherent cells. Twenty four hours later recombinant 
human IL-10 is added at a final concentration of 10 ng/ml and recombinant human IL2 is added the next day and 
again 2-3 days later at 50IU/ml (Tsai et aL, Critical Reviews in Immunology 1 8(l-2):65-75, 1998). Seven days 
20 later, the cultures are assayed for CTL activity in a 51 Cr release assay. In some experiments the cultures are assayed 
for peptide-specific recognition in the in situ IFNy ELISA at the time of the second restimulation followed by assay 
of endogenous recognition 7 days later. After expansion, activity is measured in both assays for a side-by-side 
comparison. 

Measurement of CTL lytic activity bv 5! Cr release. 

25 Seven days after the second restimulation, cytotoxicity is determined in a standard (5 hr) 5l Cr release assay 

by assaying individual wells at a single E:T. Peptide-pulsed targets are prepared by incubating the cells with 
lOug/ml peptide overnight at 37°C. 

Adherent target cells are removed from culture flasks with trypsin-EDTA. Target cells are labeled with 
200uCi of 51 Cr sodium chromate (Dupont, Wilmington, DE) for 1 hour at 37°C. Labeled target cells are 

30 resuspended at 10 6 per ml and diluted 1:10 with K562 cells at a concentration of 3.3xl0 6 /ml (an NK-sensitive 

erythroblastoma cell line used to reduce non-specific lysis). Target cells (100 ul) and effectors (lOOul) are plated in 
96 well round-bottom plates and incubated for 5 hours at 37°C. At that time, 100 ul of supernatant are collected 
from each well and percent lysis is determined according to the formula: , 

[(cpm of the test sample- cpm of the spontaneous 51 Cr release sample)/(cpm of the maximal 5, Cr release sample- 
35 cpm of the spontaneous 5, Cr release sample)] x 100. 

Maximum and spontaneous release are determined by incubating the labeled targets with 1% Triton X-100 
and media alone, respectively. A positive culture is defined as one in which the specific lysis (sample- background) 
is 10% or higher in the case of individual wells and is 15% or more at the two highest E:T ratios when expanded 
cultures are assayed. 

40 In situ Measurement of Human IFNv Production as an Indicator of Pep tide-soecific and Endogenous 

Recognition 
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Immulon 2 plates are coated with mouse anti-human IFNy monoclonal antibody (4 ug/ml 0. 1M NaHC0 3 , 
pH8.2) overnight at 4°C. The plates are washed with Ca 2+ , Mg 2+ -free PBS/0.05% Tween 20 and blocked with 
PBS/10% FCS for two hours, after which the CTLs (100 ul/well) and targets (100 ul/well) are added to each well, 
leaving empty wells for the standards and blanks (which received media only). The target cells, either peptide- 
pulsed or endogenous targets, are used at a concentration of lxlO 6 cells/ml. The plates are incubated for 48 hours at 
37°C with 5% C0 2 . 

Recombinant human IFN-gamma is added to the standard wells starting at 400 pg or 1200pg/100 
microliter/well and the plate incubated for two hours at 37°C. The plates are washed and 100 |il of biotinylated 
mouse anti-human IFN-gamma monoclonal antibody (2 microgram/ml in PBS/3%FCS/0.05% Tween 20) are added 
and incubated for 2 hours at room temperature. After washing again, 100 microliter HRP-streptavidin (1 :4000) are 
added and the plates incubated for one hour at room temperature. The plates are then washed 6x with wash buffer, 
100 microliter/well developing solution (TMB 1:1) are added, and the plates allowed to develop for 5-15 minutes. 
The reaction is stopped with 50 microliter/well 1M H 3 P0 4 and read at OD450. A culture is considered positive if it 
measured at least 50 pg of 1 FN -gam ma/we 11 above background and is twice the background level of expression. 

CTL Expansion. 

Those cultures that demonstrate specific lytic activity against peptide-pulsed targets and/or tumor targets 
are expanded over a two week period with anti-CD3. Briefly, 5xl0 4 CD8+ cells are added to a T25 flask containing 
the following: lxlO 6 irradiated (4,200 rad) PBMC (autologous or allogeneic) per ml, 2xl0 5 irradiated (8,000 rad) 
EBV- transformed cells per ml, and OKT3 (anti-CD3) at 30ng per ml in RPMI-1640 containing 10% (v/v) human 
AB serum, non-essential amino acids, sodium pyruvate, 25uM 2-mercaptoethanol, L-glutamine and 
penicillin/streptomycin. Recombinant human 1L2 is added 24 hours later at a final concentration of 200IU/ml and 
every three days thereafter with fresh media at 501U/ml. The cells are split if the cell concentration exceeds 
lxl0 6 /ml and the cultures are assayed between days 13 and 15 at E:T ratios of 30, 10, 3 and 1:1 in the 5l Cr release 
assay or at lxl0 6 /ml in the in situ IFNy assay using the same targets as before the expansion. 

Cultures are expanded in the absence of anti-CD3 + as follows. Those cultures that demonstrate specific 
lytic activity against peptide and endogenous targets are selected and 5xl0 4 CD8 + cells are added to a T25 flask 
containing the following: lxlO 6 autologous PBMC per ml which have been peptide-pulsed with 10 ug/ml peptide 
for two hours at 37°C and irradiated (4,200 rad); 2xl0 5 irradiated (8,000 rad) EBV-transformed cells per ml RPMI- 
1640 containing 10%(v/v) human AB serum, non-essential AA, sodium pyruvate, 25mM 2-ME, L-glutamine and 
gentamicin. 

Immunogenicitv of A2 supermotif-bearing peptides 

A2-supermotif cross-reactive binding peptides are tested in the cellular assay for the ability to induce 
peptide-specific CTL in normal individuals. In this analysis, a peptide is typically considered to be an epitope if it 
induces peptide-specific CTLs in at least individuals, and preferably, also recognizes the endogenously expressed 
peptide. 

Immunogenicity can also be confirmed using PBMCs isolated from patients bearing a tumor that expresses 
121P1F1. Briefly, PBMCs are isolated from patients, re-stimulated with peptide-pulsed monocytes and assayed for 
the ability to recognize peptide-pulsed target cells as well as transfected cells endogenously expressing the antigen. 

Evaluation of A*03/A1 1 immunogenicitv 

HLA-A3 supermotif-bearing cross-reactive binding peptides are also evaluated for immunogenicity using 
methodology analogous for that used to evaluate the immunogenicity of the HLA-A2 supermotif peptides. 



79 



511582003440 



Evaluation of B7 immunogenicitv 

Immunogenicity screening of the B7-supertype cross-reactive binding peptides identified as set forth 
herein are confirmed in a manner analogous to the confirmation of A2-and A3 -supermotif-bearing peptides. 

Peptides bearing other supermotifs/motifs, e.g., HLA-A1, HLA-A24 etc. are also confirmed using similar 
5 methodology 

Example 14: Implementation of the Extended Supermotif to Improve the Binding Capacity of 
Native Epitopes by Creating Analogs 

HLA motifs and supermotifs (comprising primary and/or secondary residues) are useful in the 
10 identification and preparation of highly cross-reactive native peptides, as demonstrated herein. Moreover, the 

definition of HLA motifs and supermotifs also allows one to engineer highly cross-reactive epitopes by identifying 
residues within a native peptide sequence which can be analoged to confer upon the peptide certain characteristics, 
e.g. greater cross-reactivity within the group of HLA molecules that comprise a supertype, and/or greater binding 
affinity for some or all of those HLA molecules. Examples of analoging peptides to exhibit modulated binding 
1 5 affinity are set forth in this example. 

Analoging at Primary Anchor Residues 

Peptide engineering strategies are implemented to further increase the cross-reactivity of the epitopes. For 
example, the main anchors of A2-supermotif-bearing peptides are altered, for example, to introduce a preferred L, I, 
V, or M at position 2, and I or V at the C-terminus. 
20 To analyze the cross-reactivity of the analog peptides, each engineered analog is initially tested for binding 

to the prototype A2 supertype allele A*0201, then, if A*0201 binding capacity is maintained, for A2-supertype 
cross-reactivity. 

Alternatively, a peptide is confirmed as binding one or all supertype members and then analoged to 
modulate binding affinity to any one (or more) of the supertype members to add population coverage. 
25 The selection of analogs for immunogenicity in a cellular screening analysis is typically further restricted 

by the capacity of the parent wild type (WT) peptide to bind at least weakly, i.e., bind at an IC 50 of 5000nM or less, 
to three of more A2 supertype alleles. The rationale for this requirement is that the WT peptides must be present 
endogenously in sufficient quantity to be biologically relevant. Analoged peptides have been shown to have 
increased immunogenicity and cross-reactivity by T cells specific for the parent epitope (see, e.g., Parkhurst et ai, 
30 J. Immunol. 157:2539, 1996; and Pogue et ai, Proc. Natl. Acad. Set. USA 92:8166, 1995). 

In the cellular screening of these peptide analogs, it is important to confirm that analog-specific CTLs are 
also able to recognize the wild-type peptide and, when possible, target cells that endogenously express the epitope. 
Analoging of HLA- A3 and B7-supermotif-bearing peptides 

Analogs of HLA-A3 supermotif-bearing epitopes are generated using strategies similar to those employed 
35 in analoging HLA-A2 supermotif-bearing peptides. For example, peptides binding to 3/5 of the A3-supertype 
molecules are engineered at primary anchor residues to possess a preferred residue (V, S, M, or A) at position 2. 

The analog peptides are then tested for the ability to bind A*03 and A* 1 1 (prototype A3 supertype alleles). 
Those peptides that demonstrate < 500 nM binding capacity are then confirmed as having A3-supertype cross- 
reactivity. 

40 Similarly to the A2- and A3- motif bearing peptides, peptides binding 3 or more B7-supertype alleles can 

be improved, where possible, to achieve increased cross-reactive binding or greater binding affinity or binding half 
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life. B7 supermotif-bearing peptides are, for example, engineered to possess a preferred residue (V, I, L, or F) at 
the C-terminal primary anchor position, as demonstrated by Sidney et al. (J. Immunol. 157:3480-3490, 1996). 

Analoging at primary anchor residues of other motif and/or supermotif-bearing epitopes is performed in a 
like manner. 

5 The analog peptides are then be confirmed for immunogenicity, typically in a cellular screening assay. 

Again, it is generally important to demonstrate that analog-specific CTLs are also able to recognize the wild-type 
peptide and, when possible, targets that endogenously express the epitope. 
Analoging at Secondary Anchor Residues 

Moreover, HLA supermotifs are of value in engineering highly cross-reactive peptides and/or peptides that 
10 bind HLA molecules with increased affinity by identifying particular residues at secondary anchor positions that are 
associated with such properties. For example, the binding capacity of a B7 supermotif-bearing peptide with an F 
residue at position 1 is analyzed. The peptide is then analoged to, for example, substitute L for F at position 1 . The 
analoged peptide is evaluated for increased binding affinity, binding half life and/or increased cross-reactivity. 
Such a procedure identifies analoged peptides with enhanced properties. 
15 Engineered analogs with sufficiently improved binding capacity or cross-reactivity can also be tested for 

immunogenicity in HLA-B7-transgenic mice, following for example, IFA immunization or lipopeptide 
immunization. Analoged peptides are additionally tested for the ability to stimulate a recall response using PBMC 
from patients with 1 2 1 P 1 F 1 -expressing tumors. 
Other analoging strategies 

20 Another form of peptide analoging, unrelated to anchor positions, involves the substitution of a cysteine 

with a-amino butyric acid. Due to its chemical nature, cysteine has the propensity to form disulfide bridges and 
sufficiently alter the peptide structurally so as to reduce binding capacity. Substitution of a-amino butyric acid for 
cysteine not only alleviates this problem, but has been shown to improve binding and crossbinding capabilities in 
some instances (see, e.g., the review by Sette et al., In: Persistent Viral Infections, Eds. R. Ahmed and I. Chen, John 

25 Wiley & Sons, England, 1999). 

Thus, by the use of single amino acid substitutions, the binding properties and/or cross-reactivity of 
peptide ligands for HLA supertype molecules can be modulated. 

Example 15: Identification and confirmation of 121PlFl-derived sequences with HLA-DR binding 

30 motifs 

Peptide epitopes bearing an HLA class II supermotif or motif are identified and confirmed as outlined 
below using methodology similar to that described for HLA Class I peptides. 
Selection of HLA-DR-supermotif-bearing epitopes. 

To identify 121 PI Fl -derived, HLA class II HTL epitopes, a 121P1F1 antigen is analyzed for the presence 
35 of sequences bearing an HLA-DR-motif or supermotif. Specifically, 15-mer sequences are selected comprising a 
DR-supermotif, comprising a 9-mer core, and three-residue N- and C-terminal flanking regions (15 amino acids 
total). 

Protocols for predicting peptide binding to DR molecules have been developed (Southwood et al., J. 
Immunol. 160:3363-3373, 1998). These protocols, specific for individual DR molecules, allow the scoring, and 
40 ranking, of 9-mer core regions. Each protocol not only scores peptide sequences for the presence of DR-supermotif 
primary anchors (i.e., at position 1 and position 6) within a 9-mer core, but additionally evaluates sequences for the 
presence of secondary anchors. Using allele-specific selection tables (see, e.g., Southwood et al., ibid), it has been 
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found that these protocols efficiently select peptide sequences with a high probability of binding a particular DR 
molecule. Additionally, it has been found that performing these protocols in tandem, specifically those for DR1, 
DR4w4, and DR7, can efficiently select DR cross-reactive peptides. 

The 121 PI Fl -derived peptides identified above are tested for their binding capacity for various common 
5 HLA-DR molecules. All peptides are initially tested for binding to the DR molecules in the primary panel: DR1, 
DR4w4, and DR7. Peptides binding at least two of these three DR molecules are then tested for binding to DR2w2 
pi, DR2w2 (32, DR6wl9, and DR9 molecules in secondary assays. Finally, peptides binding at least two of the 
four secondary panel DR molecules, and thus cumulatively at least four of seven different DR molecules, are 
screened for binding to DR4wl5, DR5wll, and DR8w2 molecules in tertiary assays. Peptides binding at least 
10 seven of the ten DR molecules comprising the primary, secondary, and tertiary screening assays are considered 
cross-reactive DR binders. 12 1 PI Fl -derived peptides found to bind common HLA-DR alleles are of particular 
interest. 

Selection of DR3 motif peptides 

Because HLA-DR3 is an allele that is prevalent in Caucasian, Black, and Hispanic populations, DR3 
1 5 binding capacity is a relevant criterion in the selection of HTL epitopes. Thus, peptides shown to be candidates 

may also be assayed for their DR3 binding capacity. However, in view of the binding specificity of the DR3 motif, 
peptides binding only to DR3 can also be considered as candidates for inclusion in a vaccine formulation. 

To efficiently identify peptides that bind DR3, target 121P1F1 antigens are analyzed for sequences 
carrying one of the two DR3-specific binding motifs reported by Geluk et ai (J. Immunol 152:5742-5748, 1994). 
20 The corresponding peptides are then synthesized and confirmed as having the ability to bind DR3 with an affinity of 
luM or better, i.e., less than 1 uM. Peptides are found that meet this binding criterion and qualify as HLA class II 
high affinity binders. 

DR3 binding epitopes identified in this manner are included in vaccine compositions with DR supermotif- 
bearing peptide epitopes. 

25 Similarly to the case of HLA class I motif-bearing peptides, the class II motif-bearing peptides are 

analoged to improve affinity or cross-reactivity. For example, aspartic acid at position 4 of the 9-mer core sequence 
is an optimal residue for DR3 binding, and substitution for that residue often improves DR 3 binding. 

Example 16: Immunogenicitv of 121PlFl-derived HTL epitopes 
30 This example determines immunogenic DR supermotif- and DR3 motif-bearing epitopes among those 

identified using the methodology set forth herein. 

Immunogenicity of HTL epitopes are confirmed in a manner analogous to the determination of 
immunogenicity of CTL epitopes, by assessing the ability to stimulate HTL responses and/or by using appropriate 
transgenic mouse models. Immunogenicity is determined by screening for: 1 .) in vitro primary induction using 
35 normal PBMC or 2.) recall responses from patients who have 121PlFl-expressing tumors. 

Example 17: Calculation of phenotvpic frequencies of HLA-suoertypes in various ethnic 
backgrounds to determine breadth of population coverage 

This example illustrates the assessment of the breadth of population coverage of a vaccine composition 
40 comprised of multiple epitopes comprising multiple supermotifs and/or motifs. 

In order to analyze population coverage, gene frequencies of HLA alleles are determined. Gene 
frequencies for each HLA allele are calculated from antigen or allele frequencies utilizing the binomial distribution 
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formulae gf=l-(SQRT(l-af)) (see, e.g., Sidney et aL, Human Immunol. 45:79-93, 1996). To obtain overall 
phenotypic frequencies, cumulative gene frequencies are calculated, and the cumulative antigen frequencies derived 
by the use of the inverse formula [af=l-(l-Cgf) 2 ]. 

Where frequency data is not available at the level of DNA typing, correspondence to the serologically 
5 defined antigen frequencies is assumed. To obtain total potential supertype population coverage no linkage 

disequilibrium is assumed, and only alleles confirmed to belong to each of the supertypes are included (minimal 
estimates). Estimates of total potential coverage achieved by inter-loci combinations are made by adding to the A 
coverage the proportion of the non-A covered population that could be expected to be covered by the B alleles 
considered (e.g., total=A+B*(l-A)). Confirmed members of the A3-like supertype are A3, A 1 1, A3 1, A*3301, and 

10 A*6801. Although the A3-like supertype may also include A34, A66, and A*7401, these alleles were not included 
in overall frequency calculations. Likewise, confirmed members of the A2-like supertype family are A*0201, 
A*0202, A*0203, A*0204, A*0205, A*0206, A*0207, A*6802, and A*6901. Finally, the B7-like supertype- 
confirmed alleles are: B7, B*3501-03, B51, B*5301, B*5401, B*5501-2, B*5601, B*6701, and B*7801 
(potentially also B*1401, B*3504-06, B*4201, and B*5602). 

15 Population coverage achieved by combining the A2-, A3- and B7-supertypes is approximately 86% in five 

major ethnic groups. Coverage may be extended by including peptides bearing the Al and A24 motifs. On 
average, Al is present in 12% and A24 in 29% of the population across five different major ethnic groups 
(Caucasian, North American Black, Chinese, Japanese, and Hispanic). Together, these alleles are represented with 
an average frequency of 39% in these same ethnic populations. The total coverage across the major ethnicities 

20 when Al and A24 are combined with the coverage of the A2-, A3- and B7-supertype alleles is >95%. An 

analogous approach can be used to estimate population coverage achieved with combinations of class II motif- 
bearing epitopes. 

Immunogenicity studies in humans (e.g., Bertoni et al., J. Clin. Invest. 100:503, 1997; Doolan et al., 
Immunity 7:97, 1997; and Threlkeld et al, J. Immunol. 159: 1648, 1997) have shown that highly cross-reactive 
25 binding peptides are almost always recognized as epitopes. The use of highly cross-reactive binding peptides is an 
important selection criterion in identifying candidate epitopes for inclusion in a vaccine that is immunogenic in a 
diverse population. 

With a sufficient number of epitopes (as disclosed herein and from the art), an average population 
coverage is predicted to be greater than 95% in each of five major ethnic populations. The game theory Monte 
30 Carlo simulation analysis, which is known in the art (see e.g., Osborne, M.J. and Rubinstein, A. "A course in game 
theory" MIT Press, 1994), can be used to estimate what percentage of the individuals in a population comprised of 
the Caucasian, North American Black, Japanese, Chinese, and Hispanic ethnic groups would recognize the vaccine 
epitopes described herein. A preferred percentage is 90%. A more preferred percentage is 95%. 

35 Example 18: CTL Recognition Of Endogenouslv Processed Antigens A fter Priming 

This example confirms that CTL induced by native or analoged peptide epitopes identified and selected as 
described herein recognize endogenously synthesized, i.e., native antigens. 

Effector cells isolated from transgenic mice that are immunized with peptide epitopes, for example HLA- 
A2 supermotif-bearing epitopes, are re-stimulated in vitro using peptide-coated stimulator cells. Six days later, 
40 effector cells are assayed for cytotoxicity and the cell lines that contain peptide-specific cytotoxic activity are 

further re-stimulated. An additional six days later, these cell lines are tested for cytotoxic activity on 51 Cr labeled 
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Jurkat-A2.1/K b target cells in the absence or presence of peptide, and also tested on 51 Cr labeled target cells bearing 
the endogenously synthesized antigen, i.e. cells that are stably transfected with 121P1F1 expression vectors. 

The results demonstrate that CTL lines obtained from animals primed with peptide epitope recognize 
endogenously synthesized 121P1F1 antigen. The choice of transgenic mouse model to be used for such an analysis 
5 depends upon the epitope(s) that are being evaluated. In addition to HLA-A*0201/K b transgenic mice, several other 
transgenic mouse models including mice with human All, which may also be used to evaluate A3 epitopes, and B7 
alleles have been characterized and others (e.g., transgenic mice for HLA-A1 and A24) are being developed. HLA- 
DR1 and HLA-DR3 mouse models have also been developed, which may be used to evaluate HTL epitopes. 

10 Example 19: Activity Of CTL-HTL Conjugated Epitopes In Transgenic Mice 

This example illustrates the induction of CTLs and HTLs in transgenic mice, by use of a 121PlFl-derived 
CTL and HTL peptide vaccine compositions. The vaccine composition used herein comprise peptides to be 
administered to a patient with a 12 1 PI Fl -expressing tumor. The peptide composition can comprise multiple CTL 
and/or HTL epitopes. The epitopes are identified using methodology as described herein. This example also 
1 5 illustrates that enhanced immunogenicity can be achieved by inclusion of one or more HTL epitopes in a CTL 

vaccine composition; such a peptide composition can comprise an HTL epitope conjugated to a CTL epitope. The 
CTL epitope can be one that binds to multiple HLA family members at an affinity of 500 nM or less, or analogs of 
that epitope. The peptides may be lipidated, if desired. 

Immunization procedures: Immunization of transgenic mice is performed as described (Alexander et al., 
20 J. Immunol. 159:4753-4761, 1997). For example, A2/K b mice, which are transgenic for the human HLA A2.1 allele 
and are used to confirm the immunogenicity of HLA-A*0201 motif- or HLA-A2 supermotif-bearing epitopes, and 
are primed subcutaneously (base of the tail) with a 0.1 ml of peptide in Incomplete Freund's Adjuvant, or if the 
peptide composition is a lipidated CTL/HTL conjugate, in DMSO/saline, or if the peptide composition is a 
polypeptide, in PBS or Incomplete Freund's Adjuvant. Seven days after priming, splenocytes obtained from these 
25 animals are restimulated with syngenic irradiated LPS-activated lymphoblasts coated with peptide. 

Cell lines: Target cells for peptide-specific cytotoxicity assays are Jurkat cells transfected with the HLA- 
A2.1/K b chimeric gene (e.g., Vitiello et al., J. Exp. Med 173:1007, 1991) 

In vitro CTL activation: One week after priming, spleen cells (30x1 0 6 cells/flask) are co-cultured at 37°C 
with syngeneic, irradiated (3000 rads), peptide coated lymphoblasts (10x1 0 6 cells/flask) in 10 ml of culture 
30 medium/T25 flask. After six days, effector cells are harvested and assayed for cytotoxic activity. 

Assay for cytotoxic activity: Target cells (1.0 to 1.5xl0 6 ) are incubated at 37°C in the presence of 200 ul 
of 5, Cr. After 60 minutes, cells are washed three times and resuspended in R10 medium. Peptide is added where 
required at a concentration of 1 ug/ml. For the assay, 10 4 5, Cr-labeIed target cells are added to different 
concentrations of effector cells (final volume of 200 ul) in U-bottom 96-well plates. After a six hour incubation 
35 period at 37°C, a 0. 1 ml aliquot of supernatant is removed from each well and radioactivity is determined in a 

Micromedic automatic gamma counter. The percent specific lysis is determined by the formula: percent specific 
release = 100 x (experimental release - spontaneous release)/(maximum release - spontaneous release). To facilitate 
comparison between separate CTL assays run under the same conditions, % 5, Cr release data is expressed as lytic 
units/10 6 cells. One lytic unit is arbitrarily defined as the number of effector cells required to achieve 30% lysis of 
40 10,000 target cells in a six hour 5l Cr release assay. To obtain specific lytic units/10 6 , the lytic units/10 6 obtained in 
the absence of peptide is subtracted from the lytic units/ 10 6 obtained in the presence of peptide. For example, if 
30% 5, Cr release is obtained at the effector (E): target (T) ratio of 50: 1 (i.e., 5xl0 5 effector cells for 10,000 targets) 
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in the absence of peptide and 5:1 (i.e., 5xl0 4 effector cells for 10,000 targets) in the presence of peptide, the specific 
lytic units would be: [(l/50,000)-(l/500,000)] x 10 6 = 18 LU. 

The results are analyzed to assess the magnitude of the CTL responses of animals injected with the 
immunogenic CTL/HTL conjugate vaccine preparation and are compared to the magnitude of the CTL response 
achieved using, for example, CTL epitopes as outlined above in the Example entitled "Confirmation of 
Immunogenicity". Analyses similar to this may be performed to confirm the immunogenicity of peptide conjugates 
containing multiple CTL epitopes and/or multiple HTL epitopes. In accordance with these procedures, it is found 
that a CTL response is induced, and concomitantly that an HTL response is induced upon administration of such 
compositions. 

Example 20: Selection of CTL and HTL epitopes for inclusion in an 121PlFl-specific vaccine. 

This example illustrates a procedure for selecting peptide epitopes for vaccine compositions of the 
invention. The peptides in the composition can be in the form of a nucleic acid sequence, either single or one or 
more sequences {i.e., minigene) that encodes peptide(s), or can be single and/or polyepitopic peptides. 

The following principles are utilized when selecting a plurality of epitopes for inclusion in a vaccine 
composition. Each of the following principles is balanced in order to make the selection. 

Epitopes are selected which, upon administration, mimic immune responses that are correlated with 
121P1F1 clearance. The number of epitopes used depends on observations of patients who spontaneously clear 
1 2 1 P 1 F 1 . For example, if it has been observed that patients who spontaneously clear 1 2 1 P 1 F 1 generate an immune 
response to at least three (3) from 121P1F1 antigen, then three or four (3-4) epitopes should be included for HLA 
class I. A similar rationale is used to determine HLA class II epitopes. 

Epitopes are often selected that have a binding affinity of an IC 50 of 500 nM or less for an HLA class I 
molecule, or for class II, an IC 50 of 1000 nM or less; or HLA Class I peptides with high binding scores from the 
BIMAS web site, at URL bimas.dcrt.nih.gov/ . 

In order to achieve broad coverage of the vaccine through out a diverse population, sufficient supermotif 
bearing peptides, or a sufficient array of allele-specific motif bearing peptides, are selected to give broad population 
coverage. In one embodiment, epitopes are selected to provide at least 80% population coverage. A Monte Carlo 
analysis, a statistical evaluation known in the art, can be employed to assess breadth, or redundancy, of population 
coverage. 

When creating polyepitopic compositions, or a minigene that encodes same, it is typically desirable to 
generate the smallest peptide possible that encompasses the epitopes of interest. The principles employed are 
similar, if not the same, as those employed when selecting a peptide comprising nested epitopes. For example, a 
protein sequence for the vaccine composition is selected because it has maximal number of epitopes contained 
within the sequence, i.e., it has a high concentration of epitopes. Epitopes may be nested or overlapping (i.e., frame 
shifted relative to one another). For example, with overlapping epitopes, two 9-mer epitopes and one 10-mer 
epitope can be present in a 10 amino acid peptide. Each epitope can be exposed and bound by an HLA molecule 
upon administration of such a peptide. A multi-epitopic, peptide can be generated synthetically, recombinantly, or 
via cleavage from the native source. Alternatively, an analog can be made of this native sequence, whereby one or 
more of the epitopes comprise substitutions that alter the cross-reactivity and/or binding affinity properties of the 
polyepitopic peptide. Such a vaccine composition is administered for therapeutic or prophylactic purposes. This 
embodiment provides for the possibility that an as yet undiscovered aspect of immune system processing will apply 
to the native nested sequence and thereby facilitate the production of therapeutic or prophylactic immune response- 
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inducing vaccine compositions. Additionally such an embodiment provides for the possibility of motif-bearing 
epitopes for an HLA makeup that is presently unknown. Furthermore, this embodiment (absent the creating of any 
analogs) directs the immune response to multiple peptide sequences that are actually present in 121 PI Fl, thus 
avoiding the need to evaluate any junctional epitopes. Lastly, the embodiment provides an economy of scale when 
5 producing nucleic acid vaccine compositions. Related to this embodiment, computer programs can be derived in 
accordance with principles in the art, which identify in a target sequence, the greatest number of epitopes per 
sequence length. 

A vaccine composition comprised of selected peptides, when administered, is safe, efficacious, and elicits 
an immune response similar in magnitude to an immune response that controls or clears cells that bear or 
10 overexpress 121P1F1. 

Example 21: Construction of "Minigene" Multi-Epitope DNA Plasmids 

This example discusses the construction of a minigene expression plasmid. Minigene plasmids may, of 
course, contain various configurations of B cell, CTL and/or HTL epitopes or epitope analogs as described herein. 

1 5 A minigene expression plasmid typically includes multiple CTL and HTL peptide epitopes. In the present 

example, HLA-A2, -A3, -B7 supermotif-bearing peptide epitopes and HLA-A1 and -A24 motif-bearing peptide 
epitopes are used in conjunction with DR supermotif-bearing epitopes and/or DR3 epitopes. HLA class I 
supermotif or motif-bearing peptide epitopes derived 121P1F1, are selected such that multiple supermotifs/motifs 
are represented to ensure broad population coverage. Similarly, HLA class II epitopes are selected from 121P1F1 

20 to provide broad population coverage, i.e. both HLA DR- 1-4-7 supermotif-bearing epitopes and HLA DR-3 motif- 
bearing epitopes are selected for inclusion in the minigene construct. The selected CTL and HTL epitopes are then 
incorporated into a minigene for expression in an expression vector. 

Such a construct may additionally include sequences that direct the HTL epitopes to the endoplasmic 
reticulum. For example, the Ii protein may be fused to one or more HTL epitopes as described in the art, wherein 

25 the CLIP sequence of the Ii protein is removed and replaced with an HLA class II epitope sequence so that HLA 
class II epitope is directed to the endoplasmic reticulum, where the epitope binds to an HLA class II molecules. 

This example illustrates the methods to be used for construction of a minigene-bearing expression plasmid. 
Other expression vectors that may be used for minigene compositions are available and known to those of skill in 
the art. 

30 The minigene DNA plasmid of this example contains a consensus Kozak sequence and a consensus murine 

kappa Ig-light chain signal sequence followed by CTL and/or HTL epitopes selected in accordance with principles 
disclosed herein. The sequence encodes an open reading frame fused to the Myc and His antibody epitope tag 
coded for by the pcDNA 3.1 Myc-His vector. 

Overlapping oligonucleotides that can, for example, average about 70 nucleotides in length with 15 

35 nucleotide overlaps, are synthesized and HPLC-purified. The oligonucleotides encode the selected peptide epitopes 
as well as appropriate linker nucleotides, Kozak sequence, and signal sequence. The final multiepitope minigene is 
assembled by extending the overlapping oligonucleotides in three sets of reactions using PCR. A Perkin/Elmer 
9600 PCR machine is used and a total of 30 cycles are performed using the following conditions: 95°C for 15 sec, 
annealing temperature (5° below the lowest calculated Tm of each primer pair) for 30 sec, and 72°C for I min, 

40 For example, a minigene is prepared as follows. For a first PCR reaction, 5 ug of each of two 

oligonucleotides are annealed and extended: In an example using eight oligonucleotides, i.e., four pairs of primers, 
oligonucleotides 1+2, 3+4, 5+6, and 7+8 are combined in 100 ul reactions containing Pfu polymerase buffer (lx= 
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10 mM KCL, 10 mM (NH4) 2 S0 4 , 20 mM Tris-chloride, pH 8.75, 2 mM MgS0 4 , 0.1% Triton X-100, 100 ug/ml 
BSA), 0.25 mM each dNTP, and 2.5 U of Pfit polymerase. The full-length dimer products are gel-purified, and two 
reactions containing the product of 1+2 and 3+4, and the product of 5+6 and 7+8 are mixed, annealed, and extended 
for 10 cycles. Half of the two reactions are then mixed, and 5 cycles of annealing and extension carried out before 
5 flanking primers are added to amplify the full length product. The full-length product is gel-purified and cloned 
into pCR-blunt (Invitrogen) and individual clones are screened by sequencing. 

Example 22: The Plasmid Construct and the Degree to Which It Induces Immunogenicitv. 

The degree to which a plasmid construct, for example a plasmid constructed in accordance with the 
10 previous Example, is able to induce immunogenicity is confirmed in vitro by determining epitope presentation by 
APC following transduction or transfection of the APC with an epitope-expressing nucleic acid construct. Such a 
study determines "antigenicity" and allows the use of human APC. The assay determines the ability of the epitope 
to be presented by the APC in a context that is recognized by a T cell by quantifying the density of epitope-HLA 
class I complexes on the cell surface. Quantitation can be performed by directly measuring the amount of peptide 
15 eluted from the APC {see, e.g., Sijts et al.,J. Immunol. 156:683-692, 1996; Demotz et ai, Nature 342:682-684, 
1989); or the number of peptide-HLA class I complexes can be estimated by measuring the amount of lysis or 
lymphokine release induced by diseased or transfected target cells, and then determining the concentration of 
peptide necessary to obtain equivalent levels of lysis or lymphokine release (see, e.g., Kageyama et al, J. Immunol. 
154:567-576, 1995). 

20 Alternatively, immunogenicity is confirmed through in vivo injections into mice and subsequent in vitro 

assessment of CTL and HTL activity, which are analyzed using cytotoxicity and proliferation assays, respectively, 

as detailed e.g., in Alexander et ai, Immunity 1:751-761, 1994. 

For example, to confirm the capacity of a DNA minigene construct containing at least one HLA-A2 

supermotif peptide to induce CTLs in vivo, HLA-A2.1/K b transgenic mice, for example, are immunized 
25 intramuscularly with 100 ug of naked cDNA. As a means of comparing the level of CTLs induced by cDNA 

immunization, a control group of animals is also immunized with an actual peptide composition that comprises 

multiple epitopes synthesized as a single polypeptide as they would be encoded by the minigene. 

Splenocytes from immunized animals are stimulated twice with each of the respective compositions 

(peptide epitopes encoded in the minigene or the polyepitopic peptide), then assayed for peptide-specific cytotoxic 
30 activity in a 5l Cr release assay. The results indicate the magnitude of the CTL response directed against the A2- 

restricted epitope, thus indicating the in vivo immunogenicity of the minigene vaccine and polyepitopic vaccine. 

It is, therefore, found that the minigene elicits immune responses directed toward the HLA-A2 supermotif 

peptide epitopes as does the polyepitopic peptide vaccine. A similar analysis is also performed using other HLA- 

A3 and HLA-B7 transgenic mouse models to assess CTL induction by HLA-A3 and HLA-B7 motif or supermotif 
35 epitopes, whereby it is also found that the minigene elicits appropriate immune responses directed toward the 

provided epitopes. 

To confirm the capacity of a class II epitope-encoding minigene to induce HTLs in vivo, DR transgenic 
mice, or for those epitopes that cross react with the appropriate mouse MHC molecule, I-A b -restricted mice, for 
example, are immunized intramuscularly with 100 ug of plasmid DNA. As a means of comparing the level of 
40 HTLs induced by DNA immunization, a group of control animals is also immunized with an actual peptide 

composition emulsified in complete Freund's adjuvant. CD4+ T cells, i.e. HTLs, are purified from splenocytes of 
immunized animals and stimulated with each of the respective compositions (peptides encoded in the minigene). 
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The HTL response is measured using a 3 H-thymidine incorporation proliferation assay, (see, e.g., Alexander et ai. 
Immunity 1:751-761, 1994). The results indicate the magnitude of the HTL response, thus demonstrating the in 
vivo immunogenicity of the minigene. 

DNA minigenes, constructed as described in the previous Example, can also be confirmed as a vaccine in 
5 combination with a boosting agent using a prime boost protocol. The boosting agent can consist of recombinant 
protein (e.g., Barnett et ai, Aids Res. and Human Retroviruses 14, Supplement 3:S299-S309, 1998) or recombinant 
vaccinia, for example, expressing a minigene or DNA encoding the complete protein of interest (see, e.g., Hanke et 
ai, Vaccine 16:439-445, 1998; Sedegah et ai, Proc. Natl. Acad Sci USA 95:7648-53, 1998; Hanke and 
McMichael, Immunol. Letters 66:177-181, 1999; and Robinson et ai, Nature Med. 5:526-34, 1999). 

10 For example, the efficacy of the DNA minigene used in a prime boost protocol is initially evaluated in 

transgenic mice. In this example, A2.1/K b transgenic mice are immunized IM with 100 ug of a DNA minigene 
encoding the immunogenic peptides including at least one HLA-A2 supermotif-bearing peptide. After an 
incubation period (ranging from 3-9 weeks), the mice are boosted IP with 10 7 pfu/mouse of a recombinant vaccinia 
virus expressing the same sequence encoded by the DNA minigene. Control mice are immunized with 100 ug of 

1 5 DNA or recombinant vaccinia without the minigene sequence, or with DNA encoding the minigene, but without the 
vaccinia boost. After an additional incubation period of two weeks, splenocytes from the mice are immediately 
assayed for peptide-specific activity in an ELISPOT assay. Additionally, splenocytes are stimulated in vitro with 
the A2-restricted peptide epitopes encoded in the minigene and recombinant vaccinia, then assayed for peptide- 
specific activity in an alpha, beta and/or gamma I FN ELISA. 

20 It is found that the minigene utilized in a prime-boost protocol elicits greater immune responses toward the 

HLA-A2 supermotif peptides than with DNA alone. Such an analysis can also be performed using HLA-A1 1 or 
HLA-B7 transgenic mouse models to assess CTL induction by HLA-A3 or HLA-B7 motif or supermotif epitopes. 
The use of prime boost protocols in humans is described below in the Example entitled "Induction of CTL 
Responses Using a Prime Boost Protocol " 

25 

Example 23: Peptide Compositions for Prophylactic Uses 

Vaccine compositions of the present invention can be used to prevent 121P1F1 expression in persons who 
are at risk for tumors that bear this antigen. For example, a polyepitopic peptide epitope composition (or a nucleic 
acid comprising the same) containing multiple CTL and HTL epitopes such as those selected in the above 
30 Examples, which are also selected to target greater than 80% of the population, is administered to individuals at risk 
for a 1 2 1 P 1 F 1 -associated tumor. 

For example, a peptide-based composition is provided as a single polypeptide that encompasses multiple 
epitopes. The vaccine is typically administered in a physiological solution that comprises an adjuvant, such as 
Incomplete Freunds Adjuvant. The dose of peptide for the initial immunization is from about 1 to about 50,000 ug, 
35 generally 100-5,000 ug, for a 70 kg patient. The initial administration of vaccine is followed by booster dosages at 
4 weeks followed by evaluation of the magnitude of the immune response in the patient, by techniques that 
determine the presence of epitope-specific CTL populations in a PBMC sample. Additional booster doses are 
administered as required. The composition is found to be both safe and efficacious as a prophylaxis against 
1 2 1 P 1 F 1 -associated disease. 

40 Alternatively, a composition typically comprising transfecting agents is used for the administration of a 

nucleic acid-based vaccine in accordance with methodologies known in the art and disclosed herein. 
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Example 24: Polvepitopic Vaccine Compositions Derived from Native 121P1F1 Sequences 
A native 121P1F1 polyprotein sequence is analyzed, preferably using computer algorithms defined for 
each class I and/or class II supermotif or motif, to identify "relatively short" regions of the polyprotein that 
comprise multiple epitopes. The "relatively short" regions are preferably less in length than an entire native 
5 antigen. This relatively short sequence that contains multiple distinct or overlapping, "nested" epitopes is selected; 
it can be used to generate a minigene construct. The construct is engineered to express the peptide, which 
corresponds to the native protein sequence. The "relatively short" peptide is generally less than 250 amino acids in 
length, often less than 100 amino acids in length, preferably less than 75 amino acids in length, and more preferably 
less than 50 amino acids in length. The protein sequence of the vaccine composition is selected because it has 
10 maximal number of epitopes contained within the sequence, i.e., it has a high concentration of epitopes. As noted 
herein, epitope motifs may be nested or overlapping (i.e., frame shifted relative to one another). For example, with 
overlapping epitopes, two 9-mer epitopes and one 10-mer epitope can be present in a 10 amino acid peptide. Such a 
vaccine composition is administered for therapeutic or prophylactic purposes. 

The vaccine composition will include, for example, multiple CTL epitopes from 12 1 PI Fl antigen and at 
1 5 least one HTL epitope. This polyepitopic native sequence is administered either as a peptide or as a nucleic acid 

sequence which encodes the peptide. Alternatively, an analog can be made of this native sequence, whereby one or 
more of the epitopes comprise substitutions that alter the cross-reactivity and/or binding affinity properties of the 
polyepitopic peptide. 

The embodiment of this example provides for the possibility that an as yet undiscovered aspect of immune 
20 system processing will apply to the native nested sequence and thereby facilitate the production of therapeutic or 
prophylactic immune response-inducing vaccine compositions. Additionally such an embodiment provides for the 
possibility of motif-bearing epitopes for an HLA makeup that is presently unknown. Furthermore, this embodiment 
(excluding an analoged embodiment) directs the immune response to multiple peptide sequences that are actually 
present in native 121P1F1, thus avoiding the need to evaluate any junctional epitopes. Lastly, the embodiment 
25 provides an economy of scale when producing peptide or nucleic acid vaccine compositions. 

Related to this embodiment, computer programs are available in the art which can be used to identify in a 
target sequence, the greatest number of epitopes per sequence length. 

Example 25: Polvepitopic Vaccine Compositions From Multiple Antigens 
30 The 121P1F1 peptide epitopes of the present invention are used in conjunction with epitopes from other 

target tumor-associated antigens, to create a vaccine composition that is useful for the prevention or treatment of 
cancer that expresses 1 2 1 P 1 F 1 and such other antigens. For example, a vaccine composition can be provided as a 
single polypeptide that incorporates multiple epitopes from 121P1F1 as well as tumor-associated antigens that are 
often expressed with a target cancer associated with 121P1F1 expression, or can be administered as a composition 
35 comprising a cocktail of one or more discrete epitopes. Alternatively, the vaccine can be administered as a 
minigene construct or as dendritic cells which have been loaded with the peptide epitopes in vitro. 

Example 26: Use of peptides to evaluate an immune response 

Peptides of the invention may be used to analyze an immune response for the presence of specific 
40 antibodies, CTL or HTL directed to 1 2 1 P 1 F 1 . Such an analysis can be performed in a manner described by Ogg et 
aL, Science 279:2103-2106, 1998. In this Example, peptides in accordance with the invention are used as a reagent 
for diagnostic or prognostic purposes, not as an immunogen. 
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In this example highly sensitive human leukocyte antigen tetrameric complexes ("tetramers") are used for 
a cross-sectional analysis of, for example, 121P1F1 HLA-A*0201 -specific CTL frequencies from HLA A*0201- 
positive individuals at different stages of disease or following immunization comprising an 121P1F1 peptide 
containing an A*0201 motif. Tetrameric complexes are synthesized as described (Musey et ai, N. Engl. J. Med. 
5 337:1267, 1997). Briefly, purified HLA heavy chain (A*0201 in this example) and p2-microglobulin are 
synthesized by means of a prokaryotic expression system. The heavy chain is modified by deletion of the 
transmembrane-cytosolic tail and COOH-terminal addition of a sequence containing a BirA enzymatic biotinylation 
site. The heavy chain, p2-microg!obulin, and peptide are refolded by dilution. The 45-kD refolded product is 
isolated by fast protein liquid chromatography and then biotinylated by BirA in the presence of biotin (Sigma, St. 
10 Louis, Missouri), adenosine 5' triphosphate and magnesium. Streptavidin-phycoerythrin conjugate is added in a 1 :4 
molar ratio, and the tetrameric product is concentrated to 1 mg/ml. The resulting product is referred to as tetramer- 
phycoerythrin. 

For the analysis of patient blood samples, approximately one million PBMCs are centrifuged at 300g for 5 
minutes and resuspended in 50 ul of cold phosphate-buffered saline. Tri-color analysis is performed with the 

15 tetramer-phycoerythrin, along with anti-CD8-Trico!or, and anti-CD38. The PBMCs are incubated with tetramer 
and antibodies on ice for 30 to 60 min and then washed twice before formaldehyde fixation. Gates are applied to 
contain >99.98% of control samples. Controls for the tetramers include both A*0201 -negative individuals and 
A*0201 -positive non-diseased donors. The percentage of cells stained with the tetramer is then determined by flow 
cytometry. The results indicate the number of cells in the PBMC sample that contain epitope-restricted CTLs, 

20 thereby readily indicating the extent of immune response to the 12 1 PI Fl epitope, and thus the status of exposure to 
121P1F1, or exposure to a vaccine that elicits a protective or therapeutic response. 

Example 27: Use of Peptide Epitopes to Evaluate Recall Responses 

The peptide epitopes of the invention are used as reagents to evaluate T cell responses, such as acute or 
25 recall responses, in patients. Such an analysis may be performed on patients who have recovered from 121P1F1- 
associated disease or who have been vaccinated with an 12 1 P 1 Fl vaccine. 

For example, the class I restricted CTL response of persons who have been vaccinated may be analyzed. 
The vaccine may be any 121P1F1 vaccine. PBMC are collected from vaccinated individuals and HLA typed. 
Appropriate peptide epitopes of the invention that, optimally, bear supermotifs to provide cross-reactivity with 
30 multiple HLA supertype family members, are then used for analysis of samples derived from individuals who bear 
that HLA type. 

PBMC from vaccinated individuals are separated on Ficoll-Histopaque density gradients (Sigma Chemical 
Co., St. Louis, MO), washed three times in HBSS (GIBCO Laboratories), resuspended in RPM1-1640 (GIBCO 
Laboratories) supplemented with L-glutamine (2mM), penicillin (50U/ml), streptomycin (50 Mg/ml), and Hepes 

35 (lOmM) containing 10% heat- inactivated human AB serum (complete RPMI) and plated using microculture 

formats. A synthetic peptide comprising an epitope of the invention is added at 10 ug/ml to each well and HBV 
core 128-140 epitope is added at 1 ug/ml to each well as a source of T cell help during the first week of stimulation. 

In the microculture format, 4 x 10 5 PBMC are stimulated with peptide in 8 replicate cultures in 96-well 
round bottom plate in 100 ul/well of complete RPMI. On days 3 and 10, 100 ul of complete RPMI and 20 U/ml 

40 final concentration of rIL-2 are added to each well. On day 7 the cultures are transferred into a 96-well flat-bottom 
plate and restimulated with peptide, rIL-2 and 10 5 irradiated (3,000 rad) autologous feeder cells. The cultures are 
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tested for cytotoxic activity on day 14. A positive CTL response requires two or more of the eight replicate cultures 
to display greater than 10% specific 5l Cr release, based on comparison with non-diseased control subjects as 
previously described (Rehermann, et ai, Nature Med. 2:1 104,1 108, 1996; Rehermann et ai, J. Clin. Invest. 
97:1655-1665, 1996; and Rehermann et ai J. Clin. Invest. 98:1432-1440, 1996). 
5 Target cell lines are autologous and allogeneic EBV-transformed B-LCL that are either purchased from the 

American Society for Histocompatibility and Immunogenetics (ASHI, Boston, MA) or established from the pool of 
patients as described (Guilhot, et ai J. Virol. 66:2670-2678, 1992). 

Cytotoxicity assays are performed in the following manner. Target cells consist of either allogeneic HLA- 
matched or autologous EBV-transformed B lymphoblastoid cell line that are incubated overnight with the synthetic 
10 peptide epitope of the invention at 10 uM, and labeled with 100 uCi of 5, Cr (Amersham Corp., Arlington Heights, 
IL) for 1 hour after which they are washed four times with HBSS. 

Cytolytic activity is determined in a standard 4-h, split well 51 Cr release assay using U-bottomed 96 well 
plates containing 3,000 targets/well. Stimulated PBMC are tested at effector/target (E/T) ratios of 20-50:1 on day 
14. Percent cytotoxicity is determined from the formula: 100 x [(experimental release-spontaneous 
15 release)/maximum release-spontaneous release)]. Maximum release is determined by lysis of targets by detergent 
(2% Triton X-100; Sigma Chemical Co., St. Louis, MO). Spontaneous release is <25% of maximum release for all 
experiments. 

The results of such an analysis indicate the extent to which HLA-restricted CTL populations have been 

stimulated by previous exposure to 12 1 PI Fl or an 12 1 PI Fl vaccine. 
20 Similarly, Class II restricted HTL responses may also be analyzed. Purified PBMC are cultured in a 96- 

well flat bottom plate at a density of 1.5x1 0 5 cells/well and are stimulated with 10 ug/ml synthetic peptide of the 

invention, whole 121P1F1 antigen, or PHA. Cells are routinely plated in replicates of 4-6 wells for each condition. 

After seven days of culture, the medium is removed and replaced with fresh medium containing lOU/ml IL-2. Two 

days later, 1 uCi 3 H-thymidine is added to each well and incubation is continued for an additional 18 hours. 
25 Cellular DNA is then harvested on glass fiber mats and analyzed for 3 H-thymidine incorporation. Antigen-specific 

T cell proliferation is calculated as the ratio of 3 H-thymidine incorporation in the presence of antigen divided by the 

3 H-thymidine incorporation in the absence of antigen. 

Example 28: Induction Of Specific CTL Response In Humans 
30 A human clinical trial for an immunogenic composition comprising CTL and HTL epitopes of the 

invention is set up as an IND Phase I, dose escalation study and carried out as a randomized, double-blind, placebo- 
controlled trial. Such a trial is designed, for example, as follows: 

A total of about 27 individuals are enrolled and divided into 3 groups: 

Group I: 3 subjects are injected with placebo and 6 subjects are injected with 5 ug of peptide composition; 
35 Group II: 3 subjects are injected with placebo and 6 subjects are injected with 50 u£ peptide composition; 

Group III: 3 subjects are injected with placebo and 6 subjects are injected with 500 jig of peptide 
composition. 

After 4 weeks following the first injection, all subjects receive a booster inoculation at the same dosage. 

The endpoints measured in this study relate to the safety and tolerability of the peptide composition as well 
40 as its immunogenicity. Cellular immune responses to the peptide composition are an index of the intrinsic activity 
of this the peptide composition, and can therefore be viewed as a measure of biological efficacy. The following 
summarize the clinical and laboratory data that relate to safety and efficacy endpoints. 
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Safety: The incidence of adverse events is monitored in the placebo and drug treatment group and 
assessed in terms of degree and reversibility. 

Evaluation of Vaccine Efficacy: For evaluation of vaccine efficacy, subjects are bled before and after 
injection. Peripheral blood mononuclear cells are isolated from fresh heparinized blood by Ficoll-Hypaque density 
5 gradient centrifugation, aliquoted in freezing media and stored frozen. Samples are assayed for CTL and HTL 
activity. 

The vaccine is found to be both safe and efficacious. 

Example 29: Phase II Trials In Patients Expressing 121P1F1 

10 Phase II trials are performed to study the effect of administering the CTL-HTL peptide compositions to 

patients having cancer that expresses 1 2 1 P 1 F 1 . The main objectives of the trial are to determine an effective dose 
and regimen for inducing CTLs in cancer patients that express 12 1 PI Fl , to establish the safety of inducing a CTL 
and HTL response in these patients, and to see to what extent activation of CTLs improves the clinical picture of 
these patients, as manifested, e.g., by the reduction and/or shrinking of lesions. Such a study is designed, for 

1 5 example, as follows: 

The studies are performed in multiple centers. The trial design is an open-label, uncontrolled, dose 
escalation protocol wherein the peptide composition is administered as a single dose followed six weeks later by a 
single booster shot of the same dose. The dosages are 50, 500 and 5,000 micrograms per injection. Drug- 
associated adverse effects (severity and reversibility) are recorded. 

20 There are three patient groupings. The first group is injected with 50 micrograms of the peptide 

composition and the second and third groups with 500 and 5,000 micrograms of peptide composition, respectively. 
The patients within each group range in age from 21-65 and represent diverse ethnic backgrounds. All of them 
have a tumor that expresses 1 2 1 P 1 F 1 . 

Clinical manifestations or antigen-specific T-cell responses are monitored to assess the effects of 

25 administering the peptide compositions. The vaccine composition is found to be both safe and efficacious in the 
treatment of 121PlFl-associated disease. 

Example 30: Induction of CTL Responses Using a Prime Boost Protocol 

A prime boost protocol similar in its underlying principle to that used to confirm the efficacy of a DNA 
30 vaccine in transgenic mice, such as described above in the Example entitled "The Plasmid Construct and the Degree 
to Which It Induces Immunogenicity," can also be used for the administration of the vaccine to humans. Such a 
vaccine regimen can include an initial administration of, for example, naked DNA followed by a boost using 
recombinant virus encoding the vaccine, or recombinant protein/poly peptide or a peptide mixture administered in an 
adjuvant. 

35 For example, the initial immunization may be performed using an expression vector, such as that 

constructed in the Example entitled "Construction of 'Minigene' Multi-Epitope DNA Plasmids" in the form of 
naked nucleic acid administered IM (or SC or ID) in the amounts of 0.5-5 mg at multiple sites. The nucleic acid 
(0.1 to 1000 ug) can also be administered using a gene gun. Following an incubation period of 3-4 weeks, a booster 
dose is then administered. The booster can be recombinant fowlpox virus administered at a dose of 5-1 0 7 to 5xl0 9 

40 pfii. An alternative recombinant virus, such as an MVA, canarypox, adenovirus, or adeno-associated virus, can also 
be used for the booster, or the polyepitopic protein or a mixture of the peptides can be administered. For evaluation 
of vaccine efficacy, patient blood samples are obtained before immunization as well as at intervals following 
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administration of the initial vaccine and booster doses of the vaccine. Peripheral blood mononuclear cells are 
isolated from fresh heparinized blood by Ficoll-Hypaque density gradient centrifugation, aliquoted in freezing 
media and stored frozen. Samples are assayed for CTL and HTL activity. 

Analysis of the results indicates that a magnitude of response sufficient to achieve a therapeutic or 
5 protective immunity against 1 2 1 P 1 F 1 is generated. 

Example 31 : Administration of Vaccine Compositions Using Dendritic Cells (DC) 

Vaccines comprising peptide epitopes of the invention can be administered using APCs, or "professional" 
APCs such as DC. In this example, peptide-pulsed DC are administered to a patient to stimulate a CTL response in 

10 vivo. In this method, dendritic cells are isolated, expanded, and pulsed with a vaccine comprising peptide CTL and 
HTL epitopes of the invention. The dendritic cells are infused back into the patient to elicit CTL and HTL 
responses in vivo. The induced CTL and HTL then destroy or facilitate destruction, respectively, of the target cells 
that bear the 121 PI Fl protein from which the epitopes in the vaccine are derived. 

For example, a cocktail of epitope-comprising peptides is administered ex vivo to PBMC, or isolated DC 

1 5 therefrom. A pharmaceutical to facilitate harvesting of DC can be used, such as Progenipoietin™ (Monsanto, St. 
Louis, MO) or GM-CSF/IL-4. After pulsing the DC with peptides, and prior to reinfusion into patients, the DC are 
washed to remove unbound peptides. 

As appreciated clinically, and readily determined by one of skill based on clinical outcomes, the number of 
DC reinfused into the patient can vary (see, e.g., Nature Med 4:328, 1998; Nature Med. 2:52, 1996 and Prostate 

20 32:272, 1997). Although 2-50 x 10 6 DC per patient are typically administered, larger number of DC, such as 10 7 or 
10 8 can also be provided. Such cell populations typically contain between 50-90% DC. 

In some embodiments, peptide-loaded PBMC are injected into patients without purification of the DC. For 
example, PBMC generated after treatment with an agent such as Progenipoietin™ are injected into patients without 
purification of the DC. The total number of PBMC that are administered often ranges from 10 8 to 10 10 . Generally, 

25 the cell doses injected into patients is based on the percentage of DC in the blood of each patient, as determined, for 
example, by immunofluorescence analysis with specific anti-DC antibodies. Thus, for example, if Progenipoietin™ 
mobilizes 2% DC in the peripheral blood of a given patient, and that patient is to receive 5 x 10 6 DC, then the 
patient will be injected with a total of 2.5 x 10 8 peptide-loaded PBMC. The percent DC mobilized by an agent such 
as Progenipoietin™ is typically estimated to be between 2-10%, but can vary as appreciated by one of skill in the 

30 art. 

Ex vivo activation of CTL/HTL responses 

Alternatively, ex vivo CTL or HTL responses to 121 PI Fl antigens can be induced by incubating, in tissue 
culture, the patient's, or genetically compatible, CTL or HTL precursor cells together with a source of APC, such as 
DC, and immunogenic peptides. After an appropriate incubation time (typically about 7-28 days), in which the 
35 precursor cells are activated and expanded into effector cells, the cells are infused into the patient, where they will 
destroy (CTL) or facilitate destruction (HTL) of their specific target cells, i.e., tumor cells. 

Example 32: An Alternative Method of Identifying and Confirming Mo tif-Bearing Peptides 
Another method of identifying and confirming motif-bearing peptides is to elute them from cells bearing 
40 defined MHC molecules. For example, EBV transformed B cell lines used for tissue typing have been extensively 
characterized to determine which HLA molecules they express. In certain cases these cells express only a single 
type of HLA molecule. These cells can be transfected with nucleic acids that express the antigen of interest, e.g. 
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12 1P1F1 . Peptides produced by endogenous antigen processing of peptides produced as a result of transfection will 
then bind to HLA molecules within the cell and be transported and displayed on the cell's surface. Peptides are 
then eluted from the HLA molecules by exposure to mild acid conditions and their amino acid sequence 
determined, e.g., by mass spectral analysis (e.g., Kubo et ai., J. Immunol. 152:3913, 1994). Because the majority of 
peptides that bind a particular HLA molecule are motif-bearing, this is an alternative modality for obtaining the 
motif-bearing peptides correlated with the particular HLA molecule expressed on the cell. 

Alternatively, cell lines that do not express endogenous HLA molecules can be transfected with an 
expression construct encoding a single HLA allele. These cells can then be used as described, i.e., they can then be 
transfected with nucleic acids that encode 121 PI Fl to isolate peptides corresponding to 121 PI Fl that have been 
presented on the cell surface. Peptides obtained from such an analysis will bear motif(s) that correspond to binding 
to the single HLA allele that is expressed in the cell. 

As appreciated by one in the art, one can perform a similar analysis on a cell bearing more than one HLA 
allele and subsequently determine peptides specific for each HLA allele expressed. Moreover, one of skill would 
also recognize that means other than transfection, such as loading with a protein antigen, can be used to provide a 
source of antigen to the cell. 

Example 33: Complementary Polynucleotides 

Sequences complementary to the 121 PI Fl -encoding sequences, or any parts thereof, are used to detect, 
decrease, or inhibit expression of naturally occurring 121P1F1. Although use of oligonucleotides comprising from 
about 15 to 30 base pairs is described, essentially the same procedure is used with smaller or with larger sequence 
fragments. Appropriate oligonucleotides are designed using, e.g., OLIGO 4.06 software (National Biosciences) and 
the coding sequence of 12 1 PI Fl . To inhibit transcription, a complementary oligonucleotide is designed from the 
most unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to a 121PlFl-encoding transcript. 

Example 34: Purification of Naturally-occurring or Recombinant 121P1F1 Usi ng 121P1F1 Specific 
Antibodies 

Naturally occurring or recombinant 121P1F1 is substantially purified by immunoaffinity chromatography 
using antibodies specific for 121 PI Fl. An immunoaffinity column is constructed by covalently coupling anti- 
121P1F1 antibody to an activated chromatographic resin, such as CNBr-activated SEPH AROSE (Amersham 
Pharmacia Biotech). After the coupling, the resin is blocked and washed according to the manufacturer's 
instructions. 

Media containing 12 1 PI Fl are passed over the immunoaffinity column, and the column is washed under 
conditions that allow the preferential absorbance of 121 PI Fl (e.g., high ionic strength buffers in the presence of 
detergent). The column is eluted under conditions that disrupt antibody/121 PI Fl binding (e.g., a buffer of pH 2 to 
pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and GCR.P is collected. 

Example 35: Identification of Molecules Which Interact with 121P1F1 

121P1F1, or biologically active fragments thereof, are labeled with 121 1 Bolton-Hunter reagent. (See, 
e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate molecules previously arrayed in the wells of a multi-well 
plate are incubated with the labeled 12 1 PI Fl, washed, and any wells with labeled 121P1F1 complex are assayed. 
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Data obtained using different concentrations of 121 PI Fl are used to calculate values for the number, affinity, and 
association of 121 PI Fl with the candidate molecules. 

5 Example 36: In Vivo Assay for 121P1F1 Tumor Growth Promotion 

The effect of the 121P1F1 protein on tumor cell growth is evaluated in vivo by evaluating tumor 
development and growth of cells expressing or lacking 121P1F1. For example, SC1D mice are injected 
subcutaneously on each flank with 1 x 10 6 of either 3T3, prostate, kidney or breast cancer cell lines (e.g. PC3, 
DU145, CaKi, SW 839, MCF7 cells) containing tkNeo empty vector or 121P1F1. At least two strategies can be 

10 used: (1) Constitutive 121P1F1 expression under regulation of a promoter, such as a constitutive promoter obtained 
from the genomes of viruses such as polyoma virus, fowlpox virus (see UK 2,21 1,504, published 5 July 1989), 
adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, 
hepatitis-B virus and Simian Virus 40 (SV40), or from heterologous mammalian promoters, e.g., the actin promoter 
or an immunoglobulin promoter, provided such promoters are compatible with the host cell systems, and (2) 

1 5 Regulated expression under control of an inducible vector system, such as ecdysone, tetracycline, etc., provided 
such promoters are compatible with the host cell systems. Tumor volume is then monitored by caliper 
measurement at the appearance of palpable tumors and followed over time to determine if 1 2 1 P 1 F I -expressing cells 
grow at a faster rate and whether tumors produced by 121 PI Fl -expressing cells demonstrate characteristics of 
altered aggressiveness (e.g. enhanced metastasis, vascularization, reduced responsiveness to chemotherapeutic 

20 drugs). 

Additionally, mice can be implanted with 1 x 10 5 of the same cells orthotopically to determine if 121 PI Fl 
has an effect on local growth in the prostate, kidney or mammary gland, and whether 121P1F1 affects the ability of 
the cells to metastasize, specifically to lungs, lymph nodes, and bone marrow. 

The assay is also useful to determine the 121 PI Fl inhibitory effect of candidate therapeutic compositions, 
25 such as for example, 121P1F1 intrabodies, 121P1F1 antisense molecules and ribozymes. 

Example 37: 121P1F1 Monoclonal Antibody-mediated Inhibition of Pro state and Kidney Tumors In 

Vivo 

The significant expression of 121 PI Fl in cancer tissues, together with its restrictive expression in normal 
30 tissues, makes 121P1F1 a good target for antibody therapy. Similarly, 121P1F1 is a target for T cell-based 

immunotherapy. Thus, the therapeutic efficacy of anti-121PlFl mAbs in human prostate cancer xenograft mouse 
models is evaluated by using androgen-independent LAPC-4 and LAPC-9 xenografts (Craft, N., et al.,. Cancer Res, 
1999. 59(19): p. 5030-6) the androgen independent recombinant cell line PC3-121P1F1 and 3T3-121P1F1 (see, 
e.g., Kaighn, M.E., et ai, Invest Urol, 1979. 17(1): p. 16-23). Similarly, anti-121PlFl mAbs are evaluated in 
35 human kidney cancer xenograft models such as AGS-K3 and AGS-K6 and in recombinant kidney cell lines such as 
CaKi-121PlFl. 

Antibody efficacy on tumor growth and metastasis formation is studied, e.g., in a mouse orthotopic 
prostate cancer xenograft models and mouse kidney xenograft models. The antibodies can be unconjugated, as 
discussed in this Example, or can be conjugated to a therapeutic modality, as appreciated in the art. Anti-121P1F1 
40 mAbs inhibit formation of both the androgen-dependent LAPC-9 and androgen-independent PC3- 1 2 1 P 1 F 1 tumor 
xenografts. Anti-121P1F1 mAbs also retard the growth of established orthotopic tumors and prolonged survival of 
tumor-bearing mice. These results indicate the utility of anti-!21PlFl mAbs in the treatment of local and advanced 
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stages of prostate cancer. (See, e.g., Saffran, D., et al., PNAS 10:1073-1078 or on the World Wide Web at 
www.pnas.org/cgi/doi/10.1073/pnas.051624698). Similarly, anti-121PlFl mAbs can inhibit formation of AGS-K3 
and AGS-K6 tumors in SCID mice, and prevent or retard the growth of CaKi-121Pl Fl tumor xenografts. These 
results indicate utility of anti-121PlFl mAbs for treatment of kidney cancer. 
5 Administration of the anti-121PlFl mAbs leads to retardation of established orthotopic tumor growth and 

inhibition of metastasis to distant sites, resulting in a significant prolongation in the survival of tumor-bearing mice. 
These studies indicate that 121P1F1 as an attractive target for immunotherapy and demonstrate the therapeutic 
potential of anti-121PlFl mAbs for the treatment of local and metastatic prostate cancer. This example 
demonstrates that unconjugated 121P1F1 monoclonal antibodies are effective to inhibit the growth of human 
10 prostate tumor xenografts and human kidney xenografts grown in SCID mice; accordingly a combination of such 
efficacious monoclonal antibodies is also effective. 

Tumor inhibition using multiple unconjugated 121P1F1 mAbs 
Materials and Methods 
15 121P1F1 Monoclonal Antibodies: 

Monoclonal antibodies are raised against 121P1F1 as described in the Example entitled "Generation of 
121P1F1 Monoclonal Antibodies (mAbs)." The antibodies are characterized by ELISA, Western blot, FACS, and 
immunoprecipitation for their capacity to bind 1 2 1 P 1 F 1 . Epitope mapping data for the anti- 1 2 1 P 1 F 1 mAbs, as 
determined by ELISA and Western analysis, recognize epitopes on the 121 PI Fl protein. Immunohistochemical 
20 analysis of prostate cancer tissues and cells with these antibodies is performed. 

The monoclonal antibodies are purified from ascites or hybridoma tissue culture supernatants by Protein-G 
Sepharose chromatography, dialyzed against PBS, filter sterilized, and stored at -20°C. Protein determinations are 
performed by a Bradford assay (Bio-Rad, Hercules, CA). A therapeutic monoclonal antibody or a cocktail 
comprising a mixture of individual monoclonal antibodies is prepared and used for the treatment of mice receiving 
25 subcutaneous or orthotopic injections of LAPC-9 prostate tumor xenografts. 
Cancer Xenografts and Cell Lines 

The LAPC-9 xenograft, which expresses a wild-type androgen receptor and produces prostate-specific 
antigen (PSA), is passaged in 6- to 8-week-old male ICR-severe combined immunodeficient (SCID) mice (Taconic 
Farms) by s.c. trocar implant (Craft, N., et aL, supra). The AGS-K3 and AGS-K6 kidney xenografts are also 
30 passaged by subcutaneous implants in 6- to 8- week old SCID mice. Single-cell suspensions of tumor cells are 

prepared as described in Craft, et al. The prostate carcinoma cell line PC3 (American Type Culture Collection) is 
maintained in RPMI supplemented with L-glutamine and 10% FBS, and the kidney carcinoma line CaKi as well as 
NIH-3T3 cells (American Type Culture Collection) are maintained in DMEM supplemented with L-glutamine and 
10% FBS. 

35 A PC3- 1 2 1 P 1 F 1 , CaKi- 1 2 1 P 1 F 1 and 3T3- 1 2 1 P 1 F 1 cell populations are generated by retroviral gene 

transfer as described in Hubert, R.S., et al., STEAP: a prostate-specific cell-surface antigen highly expressed in 
human prostate tumors. Proc Natl Acad Sci USA, 1999. 96(25): p. 14523-8. 
Xenograft Mouse Models. 

Subcutaneous (s.c.) tumors are generated by injection of 1 x 10 6 LAPC-9, AGS-K3, AGS-K6, PC3, PC3- 
40 121P1F1, CaKi or CaKi- 121 PI Fl cells mixed at a 1:1 dilution with Matrigel (Collaborative Research) in the right 
flank of male SCID mice. To test antibody efficacy on tumor formation, i.p. antibody injections are started on the 
same day as tumor-cell injections. As a control, mice are injected with either purified mouse IgG (ICN) or PBS; or 
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a purified monoclonal antibody that recognizes an irrelevant antigen not expressed in human cells. In preliminary 
studies, no difference is found between mouse IgG or PBS on tumor growth. Tumor sizes are determined by 
vernier caliper measurements, and the tumor volume is calculated as length x width x height. Mice with s.c. tumors 
greater than 1 .5 cm in diameter are sacrificed. PSA levels are determined by using a PSA ELISA kit (Anogen, 
5 Mississauga, Ontario). Circulating levels of anti- 1 2 1 P 1 F 1 mAbs are determined by a capture ELISA kit (Bethy 1 
Laboratories, Montgomery, TX). (See, e.g., (Saffran, D., et al., PNAS 10:1073-1078 o r on the World Wide Web at 
www.pnas.org/cgi/ doi/1 0. 1 073/pnas.05 1 624698) 

Orthotopic injections are performed under anesthesia by using ketamine/xylazine. For prostate orthotopic 
studies, an incision is made through the abdominal muscles to expose the bladder and seminal vesicles, which then 

10 are delivered through the incision to expose the dorsal prostate. LAPC-9 cells (5 x 10 5 ) mixed with Matrigel are 
injected into each dorsal lobe in a 10-ul volume. To monitor tumor growth, mice are bled on a weekly basis for 
determination of PSA levels. For kidney orthotopic models, an incision is made through the abdominal muscles to 
expose the kidney. AGS-K3 or AGS-K6 cells mixed with Matrigel are injected under the kidney capsule. The mice 
are segregated into groups for the appropriate treatments, with anti- 121 PI Fl or control mAbs being injected i.p. 

15 Anti- 1 2 1 P 1 F 1 mAbs Inhibit Growth of 1 2 1 P 1 F 1 -Expressing Xenoeraft-Cancer Tumors 

The effect of anti-121PlFl mAbs on tumor formation is tested by using LAPC-9 and AGS-K3 orthotopic 
models. As compared with the s.c. tumor model, the orthotopic model, which requires injection of tumor cells 
directly in the mouse prostate or kidney, respectively, results in local tumor growth, development of metastasis in 
distal sites, deterioration of mouse health, and subsequent death (Saffran, D., et al., PNAS supra; Fu, X., et al., Int J 

20 Cancer, 1992. 52(6): p. 987-90; Kubota, T., J Cell Biochem, 1994. 56(1): p. 4-8). The features make the orthotopic 
model more representative of human disease progression and allow the therapeutic effect of mAbs on clinically 
relevant end points to be followed. 

Accordingly, tumor cells are injected into the mouse prostate or kidney, and 2 days later, the mice are 
segregated into two groups and treated with either: a) 200-500ug of anti-121PlFl Ab, or b) PBS three times per 

25 week for two to five weeks. 

A major advantage of the orthotopic prostate-cancer model is the ability to study the development of 
metastases. Formation of metastasis in mice bearing established orthotopic tumors is studied by IHC analysis on 
lung sections using an antibody against a prostate-specific cell-surface protein STEAP expressed at high levels in 
LAPC-9 xenografts (Hubert, R.S., et al, Proc Natl Acad Sci USA, 1999. 96(25): p. 14523-8) or anti-G250 

30 antibody for kidney cancer models. 

Mice bearing established orthotopic LAPC-9 tumors are administered lOOOug injections of either anti- 
121P1F1 mAb or PBS over a 4-week period. Mice in both groups are allowed to establish a high tumor burden 
(PSA levels greater than 300 ng/ml), to ensure a high frequency of metastasis formation in mouse lungs. Mice then 
are killed and their prostate/kideny and lungs are analyzed for the presence of tumor cells by IHC analysis. 

35 These studies demonstrate a broad anti-tumor efficacy of anti- 1 2 1 P 1 F 1 antibodies on initiation and 

progression of prostate and kidney cancer in xenograft mouse models. Anti- 121 PI Fl antibodies inhibit tumor 
formation of both androgen-dependent and androgen-independent tumors, retard the growth of already established 
tumors, and prolong the survival of treated mice. Moreover, anti- 121 P1F1 mAbs demonstrate a dramatic inhibitory 
effect on the spread of local prostate tumor to distal sites, even in the presence of a large tumor burden. Thus, anti- 

40 121P1F1 mAbs are efficacious on major clinically relevant end points (tumor growth), prolongation of survival, and 
health. 
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Example 38: Therapeutic and Diagnostic use of Anti-121 PI Fl Antibodies in Humans. 

Anti-121P1F1 monoclonal antibodies are safely and effectively used for diagnostic, prophylactic, 
prognostic and/or therapeutic purposes in humans. Western blot and immunohistochemical analysis of cancer 
tissues and cancer xenografts with anti-121 PI Fl mAb show strong extensive staining in carcinoma but significantly 
lower or undetectable levels in normal tissues. Detection of 121 PI Fl in carcinoma and in metastatic disease 
demonstrates the usefulness of the mAb as a diagnostic and/or prognostic indicator. Anti-121 PI Fl antibodies are 
therefore used in diagnostic applications such as immunohistochemistry of kidney biopsy specimens to detect 
cancer from suspect patients. 

As determined by flow cytometry, anti-121PlFl mAb specifically binds to carcinoma cells. Thus, anti- 
121P1F1 antibodies are used in diagnostic whole body imaging applications, such as radioimmunoscintigraphy and 
radioimmunotherapy, (see, e.g., Potamianos S., et. al. Anticancer Res 20(2A):925-948 (2000)) for the detection of 
localized and metastatic cancers that exhibit expression of 1 2 1 P 1 Fl . Shedding or release of an extracellular domain 
of 121 PI Fl into the extracellular milieu, such as that seen for alkaline phosphodiesterase B10 (Meerson, N. R., 
Hepatology 27:563-568 (1998)), allows diagnostic detection of 121P1F1 by anti-121PlFl antibodies in serum 
and/or urine samples from suspect patients. 

Anti-121 PI Fl antibodies that specifically bind 121P1F1 are used in therapeutic applications for the 
treatment of cancers that express 121P1F1. Anti-121P1F1 antibodies are used as an unconjugated modality and as 
conjugated form in which the antibodies are attached to one of various therapeutic or imaging modalities well 
known in the art, such as a prodrugs, enzymes or radioisotopes. In preclinical studies, unconjugated and conjugated 
anti-121 PI Fl antibodies are tested for efficacy of tumor prevention and growth inhibition in the SCID mouse 
cancer xenograft models, e.g., kidney cancer models AGS-K3 and AGS-K6, (see, e.g., the Example entitled 
"Monoclonal Antibody-mediated Inhibition of Prostate and Kidney Tumors In v/vo." Conjugated and unconjugated 
anti-121PlFl antibodies are used as a therapeutic modality in human clinical trials either alone or in combination 
with other treatments as described in following Examples. 

Example 39: Human Clinical Trials for the Treatment and Diagnosis of Hu man Carcinomas 
through use of Human Anti-121P1F1 Antibodies In vivo 

Antibodies are used in accordance with the present invention which recognize an epitope on 121 PI Fl, and 
are used in the treatment of certain tumors such as those listedin Table I. Based upon a number of factors, including 
121P1F1 expression levels, tumors such as those listed in Table I are presently preferred indications. In connection 
with each of these indications, three clinical approaches are successfully pursued. 

I. ) Adjunctive therapy: In adjunctive therapy, patients are treated with anti-121PlFl antibodies in 
combination with a chemotherapeutic or antineoplastic agent and/or radiation therapy. Primary cancer targets, such 
as those listed in Table I, are treated under standard protocols by the addition anti-121 PI Fl antibodies to standard 
first and second line therapy. Protocol designs address effectiveness as assessed by reduction in tumor mass as well 
as the ability to reduce usual doses of standard chemotherapy. These dosage reductions allow additional and/or 
prolonged therapy by reducing dose-related toxicity of the chemotherapeutic agent. Anti-121P1F1 antibodies are 
utilized in several adjunctive clinical trials in combination with the chemotherapeutic or antineoplastic agents 
adriamycin (advanced prostrate carcinoma), cisplatin (advanced head and neck and lung carcinomas), taxol (breast 
cancer), and doxorubicin (preclinical). 

II. ) Monotherapy: In connection with the use of the anti-121 PI Fl antibodies in monotherapy of 
rumors, the antibodies are administered to patients without a chemotherapeutic or antineoplastic agent. In one 
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embodiment, monotherapy is conducted clinically in end stage cancer patients with extensive metastatic disease. 
Patients show some disease stabilization. Trials demonstrate an effect in refractory patients with cancerous tumors. 

III.) Imaging Agent: Through binding a radionuclide (e.g., iodine or yttrium (I 131 , Y 90 ) to anti- 
121P1F1 antibodies, the radiolabeled antibodies are utilized as a diagnostic and/or imaging agent. In such a role, 
the labeled antibodies localize to both solid tumors, as well as, metastatic lesions of cells expressing 121P1F1. In 
connection with the use of the anti-121PlFl antibodies as imaging agents, the antibodies are used as an adjunct to 
surgical treatment of solid tumors, as both a pre-surgical screen as well as a post-operative follow-up to determine 
what tumor remains and/or returns. In one embodiment, a ( m In)-121P1F1 antibody is used as an imaging agent in 
a Phase 1 human clinical trial in patients having a carcinoma that expresses 121P1F1 (by analogy see, e.g., Divgi et 
al J. Natl. Cancer Inst. 83:97-104 (1991)). Patients are followed with standard anterior and posterior gamma 
camera. The results indicate that primary lesions and metastatic lesions are identified 

Dose and Route of Administration 

As appreciated by those of ordinary skill in the art, dosing considerations can be determined through 
comparison with the analogous products that are in the clinic. Thus, anti-121PlFl antibodies can be administered 
with doses in the range of 5 to 400 mg/m 2 , with the lower doses used, e.g., in connection with safety studies. The 
affinity of anti-121PlFl antibodies relative to the affinity of a known antibody for its target is one parameter used 
by those of skill in the art for determining analogous dose regimens. Further, anti-121PlFl antibodies that are fully 
human antibodies, as compared to the chimeric antibody, have slower clearance; accordingly, dosing in patients 
with such fully human anti-121PlFl antibodies can be lower, perhaps in the range of 50 to 300 mg/m 2 , and still 
remain efficacious. Dosing in mg/m 2 , as opposed to the conventional measurement of dose in mg/kg, is a 
measurement based on surface area and is a convenient dosing measurement that is designed to include patients of 
all sizes from infants to adults. 

Three distinct delivery approaches are useful for delivery of anti-121PlFl antibodies. Conventional 
intravenous delivery is one standard delivery technique for many tumors. However, in connection with tumors in 
the peritoneal cavity, such as tumors of the ovaries, biliary duct, other ducts, and the like, intraperitoneal 
administration may prove favorable for obtaining high dose of antibody at the tumor and to also minimize antibody 
clearance. In a similar manner, certain solid tumors possess vasculature that is appropriate for regional perfusion. 
Regional perfusion allows for a high dose of antibody at the site of a tumor and minimizes short term clearance of 
the antibody. 

Clinical Development Plan (CDP) 

Overview: The CDP follows and develops treatments of anti-121PlFl antibodies in connection with 
adjunctive therapy, monotherapy, and as an imaging agent. Trials initially demonstrate safety and thereafter 
confirm efficacy in repeat doses. Trails are open label comparing standard chemotherapy with standard therapy 
plus anti-121PlFl antibodies. As will be appreciated, one criteria that can be utilized in connection with enrollment 
of patients is 12 1 PI Fl expression levels in their tumors as determined by biopsy. 

As with any protein or antibody infusion-based therapeutic, safety concerns are related primarily to (i) 
cytokine release syndrome, i.e., hypotension, fever, shaking, chills; (ii) the development of an immunogenic 
response to the material (i.e., development of human antibodies by the patient to the antibody therapeutic, or HAH A 
response); and, (iii) toxicity to normal cells that express 12 1 PI Fl . Standard tests and follow-up are utilized to 
monitor each of these safety concerns. Anti-121P1F1 antibodies are found to be safe upon human administration. 



99 



511582003440 



Example 40: Human Clinical Trial Adjunctive Therapy with Human Anti-121P1F1 Antibody 
and Chemotherapeutic Agent 

A phase I human clinical trial is initiated to assess the safety of six intravenous doses of a human anti- 
121P1F1 antibody in connection with the treatment of a solid tumor, e.g., a cancer of a tissue listed in Table I. In 
the study, the safety of single doses of anti-121PlFl antibodies when utilized as an adjunctive therapy to an 
antineoplastic or chemotherapeutic agent, such as cisplatin, topotecan, doxorubicin, adriamycin, taxol, or the like, is 
assessed. The trial design includes delivery of six single doses of an anti-121PlFl antibody with dosage of 
antibody escalating from approximately about 25 mg/m 2 to about 275 mg/m 2 over the course of the treatment in 
accordance with the following schedule: 

Day 0 Day 7 Day 14 Day 2 1 Day 28 Day 35 

mAbDose 25 75 125 175 225 275 

mg/m 2 mg/m 2 mg/m 2 mg/m 2 mg/m 2 mg/m 2 
Chemotherapy + + + + + + 
(standard dose) 

Patients are closely followed for one-week following each administration of antibody and chemotherapy. 
In particular, patients are assessed for the safety concerns mentioned above: (i) cytokine release syndrome, i.e., 
hypotension, fever, shaking, chills; (ii) the development of an immunogenic response to the material (i.e., 
development of human antibodies by the patient to the human antibody therapeutic, or HAH A response); and, (iii) 
toxicity to normal cells that express 121P1F1. Standard tests and follow-up are utilized to monitor each of these 
safety concerns. Patients are also assessed for clinical outcome, and particularly reduction in tumor mass as 
evidenced by MR1 or other imaging. 

The anti-121PlFl antibodies are demonstrated to be safe and efficacious, Phase II trials confirm the 
efficacy and refine optimum dosing. 

Example 41: Human Clinical Trial: Monotherapy with Human Anti-121P1 F1 Antibody 

Anti-121P1F1 antibodies are safe in connection with the above-discussed adjunctive trial, a Phase II 
human clinical trial confirms the efficacy and optimum dosing for monotherapy. Such trial is accomplished, and 
entails the same safety and outcome analyses, to the above-described adjunctive trial with the exception being that 
patients do not receive chemotherapy concurrently with the receipt of doses of anti-121PlFl antibodies. 

Example 42: Human Clinical Trial: Diagnostic Imaging with Anti-1 21P1F1 Antibody 

Once again, as the adjunctive therapy discussed above is safe within the safety criteria discussed above, a 
human clinical trial is conducted concerning the use of anti- 1 2 1 P 1 F 1 antibodies as a diagnostic imaging agent. The 
protocol is designed in a substantially similar manner to those described in the art, such as in Divgi et al J. Natl 
Cancer Inst. 83:97-104 (1991). The antibodies are found to be both safe and efficacious when used as a diagnostic 
modality. 
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Example 43: Homology Comparison of 121P1F1 to Known Sequences 

The 121P1F1 gene is identical to a previously cloned and sequenced gene, namely human GAJ protein 
(gi|14149769) showing 100% identity to that protein. The closest homolog to the 121P2F1 protein is a mouse 
hypothetical 24.2kDa protein (gi| 12847934) of unknown function. The 121P1F1 protein consists of 205 amino 
5 acids, with calculated molecular weight of 23.7 kDa, and pi of 8.2. 121P1F1 is an intracellular protein, with 

primary localization to the nucleus. 12 1 PI Fl can also localize to the cytosol. Motif analysis revealed the presence 
of a basic leucine zipper motif (bZIP) (Table XXI) in 121P1F1 at amino acids 1 17-143, and a steroid hormone 
receptor signature at aa 168-189. The basic-leucine zipper (bZIP) (Table XXI) motif mediates sequence-specific 
DNA-binding and dimerization of leucine zipper motifs with other basic helix-loop-helix proteins (Alber T. Curr 

10 Opin Genet Dev. 1992, 2:205). This dimerization of the transcription factor is critical in order for DNA binding 
and transcriptional activation to occur. Members of the leucine zipper family of proteins include the Myc proto- 
oncogene (Amati B,et al. EMBO J. 1993, 12:5083). The Myc-Max dimer is a transactivating complex which 
regulates the expression of various genes, including genes involved in cell proliferation, growth and apoptosis, as 
well as differentiation (Luscher B. Gene. 2001, 277:1; Holzel M, et al, EMBO Rep. 2001, 2:1 125; Ben-Porath I, 

15 Yanuka O, Benvenisty N. Mol Cell Biol. 1999,19:3529). Myc is overexpressed in a variety of cancers, including 
prostate, breast and colon cancer (Jenkins RB, Qian J, Lieber MM, Bostwick DG. Cancer Res. 1997, 57:524; 
Buttyan R, et al. Prostate. 1987;1 1:327; Chrzan P, et al. Clin Biochem. 2001, 34:557; Hashimoto K et al, 
Carcinogenesis. 2001, 22:1965). The steroid hormone receptor signature is a fingerprint with similarity to the zinc 
finger motif. It is often found in transcription factors, where it regulates DN A-protein and protein-protein 

20 interactions by determining the specificity of interacting partners (Green S et al, EMBO J. 1988,7:3037; Ribeiro 
RC, Kushner PJ, Baxter JD. Armu Rev Med. 1995;46:443). 

The presence of leucine zipper and protein-protein interaction domains along with its localization to the 
nucleus indicate that 12 1 PI Fl plays a role in regulating gene transcription in mammalian cells, and thereby 
regulates cellular proliferation, transformation, differentiation and apoptosis. These biological functions have a 

25 direct effect on transformation, tumor growth and progression. 

Accordingly, when 121P1F1 functions as a regulator of cell transformation, tumor formation, or as a 
modulator of transcription involved in activating genes associated with inflammation, tumorigenesis or 
proliferation, 121P1F1 is useful for therapeutic, diagnostic, prognostic and/or preventative purposes. In addition, 
when a molecule, such as a variant or SNP of 121 PI Fl, is expressed in cancerous tissues, such as those listed in 

30 Table I, it is useful for therapeutic, diagnostic, prognostic and/or preventative purposes. 

Several variants of 121P1F1 have been identified, including the 5 variants shown in Figure 10 and Figure 
1 1. Several of the variants (e.g. VIA, V2, V3 and V4) contain portions of 121P1F1 while lacking others. Other 
variants contain additional sequences not found in 121 PI Fl (e.g. VIA, V2 and V3). For example, variant 1 A is 
identical to 121 PI Fl in its first 92 aa, while lacking aa 93-205 of 121P1F1 and diverging from 121PlFlin itsC- 

35 terminal 34 aa (Figure 4A and Figure 4B). Variants 1 B, 3 and 4 contain a Myc-like leucine zipper, indicating that 
they bind DNA and function as transcription factors in a manner similar to full length 121 P1F1 . Properties of 
121P1F1 and splice variants 1 A and 4 are shown in Table XXI. 



Example 44: Regulation of Transcription 

40 The nuclear localization of 121 PI Fl coupled to the presence of bZIP and protein interaction domains 

within its sequence indicate that 121P1F1 is a transcription factor and modulates the transcriptional regulation of 
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eukaryotic genes. This function is supported by published reports, which show that Myc regulates the expression of 
multiple genes including Tmp, a gene that promotes transformation (Ben-Porath I, Yanuka O, Benvenisty N. Mol 
Cell Biol. 1999,19:3529), and p21 WAF1, a gene that controls the cell cycle (Mitchell KO and El-Deiry WS, Cell 
Growth Differ 1999, 10:223). Regulation of gene expression is confirmed, e.g., by studying gene expression in 
5 cells expressing or lacking 1 2 1 P 1 F 1 . For this purpose, two types of experiments are performed. 

In the first set of experiments, RNA from parental and 121PlFl-expressing cells are extracted and 
hybridized to commercially available gene arrays (Clontech) (Smid-Koopman E et al. Br J Cancer. 2000. 83:246). 
Resting cells as well as cells treated with FBS, androgen or growth factors are compared. Differentially expressed 
genes are identified in accordance with procedures known in the art. The differentially expressed genes are then 
1 0 mapped to biological pathways (Chen K et al. Thyroid. 200 1 . 11:41.). 

In the second set of experiments, specific transcriptional pathway activation is evaluated using 
commercially available (Stratagene) luciferase reporter constructs including: NFkB-luc, SRE-luc, ELKl-luc, ARE- 
luc, p53-luc, and CRE-luc. In addition, a Myc/Max specific response element, namely E-box hexamer CACGTG 
reporter is also evaluated (Ben-Porath I et al, Mol Cell Biol 1999;1 9:3529). These transcriptional reporters contain 
1 5 consensus binding sites for known transcription factors that lie downstream of well-characterized signal 

transduction pathways, and represent a good tool to ascertain pathway activation and screen for positive and 
negative modulators of pathway activation. 

Thus, 121P1F1 plays a role in gene regulation, and it is used as a target for diagnostic, prognostic, 
preventative and/or therapeutic purposes. 



Example 45: Identification and Confirmation of Potential Signal Transduction Pathways 
Many mammalian proteins have been reported to interact with signaling molecules and to participate in 
regulating signaling pathways. (J Neurochem. 2001 ; 76:217-223). Based on their ability to mediate protein 
interactions, leucine zipper proteins have been reported to regulate signaling pathways important for cell survival 

25 and growth (Nagamura-Inoue T et al, Int Rev Immunol. 2001, 20:83). Using immunoprecipitation and Western 
blotting techniques, proteins are identified that associate with 121PIF1 and mediate signaling events. Several 
pathways known to play a role in cancer biology can be regulated by 121 PI Fl, including phospholipid pathways 
such as PI3K, AKT, etc, adhesion and migration pathways, including FAK, Rho, Rac-1, etc, as well as 
mitogenic/survival cascades such as ERK, p38, etc (Cell Growth Differ. 2000,1 1 :279; J Biol Chem. 1999, 

30 274:801; Oncogene. 2000, 19:3003, J. Cell Biol. 1997,138:913.). 

To confirm that 121 PI Fl directly or indirectly activates known signal transduction pathways in cells, 
luciferase (luc) based transcriptional reporter assays are carried out in cells expressing individual genes. These 
transcriptional reporters contain consensus-binding sites for known transcription factors that lie downstream of 
well-characterized signal transduction pathways. The reporters and examples of these associated transcription 

35 factors, signal transduction pathways, and activation stimuli are listed below. 



20 



40 



4. 



2. 



6. 



3. 



5. 



1. 



NFkB-luc, NFkB/Rel; Ik-kinase/SAPK; growth/apoptos is/stress 

SRE-luc, SRF/TCF/ELK1; MAPK/SAPK; growth/differentiation 

AP-l-luc, FOS/JLTN; MAPK/SAPK/PKC; growth/apoptosis/stress 

ARE-luc, androgen receptor; steroids/MAPK; growth/differentiation/apoptosis 

p53-luc, p53; SAPK; growth/differentiation/apoptosis 

CRE-luc, CREB/ATF2; PKA/p38; growth/apoptosis/stress 
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Gene-mediated effects can be assayed in cells showing mRNA expression. Luciferase reporter plasmids 
can be introduced by lipid-mediated transfection (TFX-50, Promega). Luciferase activity, an indicator of relative 
transcriptional activity, is measured by incubation of cell extracts with luciferin substrate and luminescence of the 
reaction is monitored in a luminometer. 

Signaling pathways activated by 12 1 PI Fl are mapped and used for the identification and validation of 
therapeutic targets. When 12 1P1F1 is involved in cell signaling, it is used as a target for diagnostic, prognostic, 
preventative and/or therapeutic purposes. 

Example 46: Involvement in Tumor Progression 

Based on the documented role of bZip and Steroid hormone receptor motifs in cell growth and 
proliferation (Holzel M et al, EMBO Rep. 2001, 2:1 125), the 121 PI Fl gene can contribute to the growth of cancer 
cells. The role of 121 PI Fl in tumor growth is confirmed in a variety of primary and transfected cell lines including 
prostate, breast and kidney cell lines, as well as NIH 3T3 cells engineered to stably express 121P1F1. Parental cells 
lacking 121P1F1 and cells expressing 121P1F1 are evaluated for cell growth using a well-documented proliferation 
assay (Fraser SP, Grimes J A, Djamgoz MB. Prostate. 2000;44:61, Johnson DE, Ochieng J, Evans SL. Anticancer 
Drugs. 1996,7:288). 

To confirm the role of 121 PI Fl in the transformation process, its effect in colony forming assays is 
investigated. Parental NIH-3T3 cells lacking 121P1F1 are compared to NIH-3T3 cells expressing 121P1F1, using a 
soft agar assay under stringent and more permissive conditions (Song Z. et al. Cancer Res. 2000;60:6730). 

To confirm the role of 121 PI Fl in invasion and metastasis of cancer cells, a well-established assay is used, 
e.g., a Transwell Insert System assay (Becton Dickinson) (Cancer Res. 1999; 59:6010). Control cells, including 
prostate, breast and kidney cell lines lacking 121P1F1 are compared to cells expressing 121P1FI. Cells are loaded 
with the fluorescent dye, calcein, and plated in the top well of the Transwell insert coated with a basement 
membrane analog. Invasion is determined by fluorescence of cells in the lower chamber relative to the fluorescence 
of the entire cell population. 

121P1F1 can also play a role in the regulation of the cell cycle and apoptosis. Parental cells and cells 
expressing 121 PI Fl are compared for differences in cell cycle regulation using a well-established BrdU assay 
(Abdel-Malek ZA. J Cell Physiol. 1988, 136:247). In short, cells are grown under both optimal (full serum) and 
limiting (low serum) conditions, and are labeled with BrdU and stained with anti-BrdU Ab and propidium iodide. 
Cells are analyzed for entry into the Gl, S, and G2M phases of the cell cycle. Alternatively, the effect of stress on 
apoptosis is evaluated in control parental cells and cells expressing 121P1F1, including normal and tumor prostate, 
colon and lung cells. Engineered and parental cells are treated with various chemotherapeutic agents, such as 
etoposide, flutamide, etc, and protein synthesis inhibitors, such as cycloheximide. Cells are stained with annexin V- 
FITC and cell death is measured by FACS analysis. The modulation of cell death by 121 PI Fl can play a critical 
role in regulating tumor progression and tumor load. 

When 121P1F1 plays a role in cell growth, transformation, invasion or apoptosis, it is used as a target for 
diagnostic, prognostic, preventative and/or therapeutic purposes. 

E xample 47: Involvement in Angjogenesis 

Angiogenesis or new capillary blood vessel formation is necessary for tumor growth (Hanahan D, Folkman 
J. Cell. 1996,86:353; Folkman J. Endocrinology. 1998 139:441). Based on the effect of phsophodieseterase 
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inhibitors on endothelial cells, 121P1F1 plays a role in angiogenesis (DeFouw L et al, Microvasc Res 2001, 
62:263). Several assays have been developed to measure angiogenesis in vitro and in vivo, such as the tissue culture 
assays based on endothelial cell tube formation and endothelial cell proliferation. Using these assays as well as in 
vitro neo- vascularization, the role of 121 PI Fl in angiogenesis, enhancement or inhibition, is confirmed. 
5 - For example, endothelial cells engineered to express 121P1F1 are evaluated using tube formation and 

proliferation assays. The effect of 121 PI Fl is also confirmed in animal models in vivo. For example, cells either 
expressing or lacking 12 1 PI Fl are implanted subcutaneously in immunocompromised mice. Endothelial cell 
migration and angiogenesis are evaluated 5-15 days later using immunohistochemistry techniques. Demonstration 
of an effect of 121 PI Fl on angiogenesis confirms its usefulness as a target for diagnostic, prognostic, preventative 
1 0 and/or therapeutic purposes 

Example 48: Involvement in Protein-Protein Interactions 

Protein containing bZip motifs have been shown to interact with other proteins, specially proteins 
containing helix-loop-helix structures, thereby regulating gene transcription as well as cell growth (Schneider A et 

15 al , Curr Top Microbiol Immunol. 1997,224:137; Amati B, Land H. Curr Opin Genet Dev. 1994, 4:102). Using 
immunoprecipitation techniques as well as two yeast hybrid systems, proteins are identified that associate with 
121P1FL Immunoprecipitates from cells expressing 121P1F1 and cells lacking 121P1F1 are compared for specific 
protein-protein associations. 

Studies are performed to confirm the extent of association of 121 PI Fl with effector molecules, such as 

20 nuclear proteins, transcription factors, kinases, phsophates etc. Studies comparing 1 2 1 P 1 F 1 positive and 1 2 1 P 1 F 1 
negative cells as well as studies comparing unstimulated/resting cells and cells treated with epithelial cell activators, 
such as cytokines, growth factors, androgen and anti-integrin Ab reveal unique interactions. 

In addition, protein-protein interactions are confirmed using two yeast hybrid methodology (Curr Opin 
Chem Biol. 1999, 3:64). A vector carrying a library of proteins fused to the activation domain of a transcription 

25 factor is introduced into yeast expressing a 121 PI Fl-DNA-binding domain fusion protein and a reporter construct. 
Protein-protein interaction is detected by colorimetric reporter activity. Specific association with effector molecules 
and transcription factors directs one of skill to the mode of action of 121 PI Fl, and thus identifies therapeutic, 
prognostic, preventative and/or diagnostic targets for cancer. This and similar assays are also used to identify and 
screen for small molecules that interact with 121P1F1. 

30 Thus it is found that 121P1F1 associates with proteins and small molecules. Accordingly, 121PlFland 

these proteins and small molecules are used for diagnostic, prognostic, preventative and/or therapeutic purposes. 

Example 49: Involvement in DNA-Protein Interactions 

As previously mentioned, the basic-leucine zipper (bZIP) motif contain a basic region that mediates 
35 sequence-specific DNA-protein binding, as well as a leucine zipper region needed for protein dimerization. 

Electrophoretic mobility shift assays (EMS A) and DNA footprinting are used to identify 121PlFl-binding DNA 
sequences, and define specific response elements. In short, nuclear lysates are extracted from parental 121 PI Fl- 
negative as well as 121PlFl-expressing cells. The lysates are incubated in the presence of 32P-labeled DNA 
probes. DNA-protein complexes are either separated by electrophoresis or exposed to a restriction nuclease, and 
40 analyzed by radiography. This process provides 1 2 1 P 1 F 1 specific DNA elements that are valuable tools in 
designing and testing inhibitors of 1 2 IP 1 F 1 . 
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When 121P1F1 functions as a transcription factor, it is used as a target for diagnostic, prognostic, 
preventative and therapeutic purposes. 

5 Throughout this application, various website data content, publications, patent applications and patents are 

referenced. (Websites are referenced by their Uniform Resource Locator, or URL, addresses on the World Wide 
Web.) The disclosures of each of these references are hereby incorporated by reference herein in their entireties. 

The present invention is not to be limited in scope by the embodiments disclosed herein, which are 
intended as single illustrations of individual aspects of the invention, and any that are functionally equivalent are 
10 within the scope of the invention. Various modifications to the models and methods of the invention, in addition to 
those described herein, will become apparent to those skilled in the art from the foregoing description and 
teachings, and are similarly intended to fall within the scope of the invention. Such modifications or other 
embodiments can be practiced without departing from the true scope and spirit of the invention. 
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TABLES 



TABLE I: Tissues that Express 121P1F1 When Malignant 

Prostate 

Bladder 

Kidney 

Colon 

Lung 

Pancreas 

Breast 

Cervix 

Stomach 

TABLE II: AMINO ACID ABBREVIATIONS 



SINGLE LETTER 


THREE LETTER 


FULL NAME 


F 


Phe 


phenylalanine 


L 


Leu 


leucine 


S 


Ser 


serine 


Y 


Tyr 


tyrosine 


C 


Cys 


cysteine 


W 


Tip 


tryptophan 


P 


Pro 


proline 


H 


His 


histidine 


Q 


Gin 


glutamine 


R 


Arg 


arginine 


I 


He 


isoleucine 


M 


Met 


methionine 


T 


Thr 


threonine 


N 


Asn 


asparagine 


K 


Lys 


lysine 


V 


Val 


valine 


A 


Ala 


alanine 


D 


Asp 


aspartic acid 


E 


Glu 


glutamic acid 


G 


Gly 


glycine 
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5 

TABLE III: AMINO ACID SUBSTITUTION MATRIX 

Adapted from the GCG Software 9.0 BLOSUM62 amino acid substitution matrix (block substitution 
matrix). The higher the value, the more likely a substitution is found in related, natural proteins. (See URL located 
on the World Wide Web at www.ikp.unibe.ch/manual/blosum62.html ) 

10 



15 



20 



25 



30 



A 


C 


D 


E 


F 


G 


H 


I 


K 


L 


M 


N 


P 


Q 


R 


S 


T 


V 


W 


Y 


. 


4 


0 


-2 


-1 


-2 


0 


-2 


-1 


-1 


-1 


-1 


-2 


-1 


-1 


-1 


1 


0 


0 


-3 


-2 


A 




9 


-3 


-4 


-2 


-3 


-3 


-1 


-3 


-1 


-1 


-3 


-3 


-3 


-3 


-1 


-1 


-1 


-2 


-2 


C 






6 


2 


-3 


-1 


-1 


-3 


-1 


-4 


-3 


1 


-1 


0 


-2 


0 


-1 


-3 


-4 


-3 


D 








5 


-3 


-2 


0 


-3 


1 


-3 


-2 


0 


-1 


2 


0 


0 


-1 


-2 


-3 


-2 


E 










6 


-3 


-1 


0 


-3 


0 


0 


-3 


-4 


-3 


-3 


-2 


-2 


-1 


1 


3 


F 












6 


-2 


-4 


-2 


-4 


-3 


0 


-2 


-2 


-2 


0 


-2 


-3 


-2 


-3 


G 














8 


-3 


-1 


-3 


-2 


1 


-2 


0 


0 


-1 


-2 


-3 


-2 


2 


H 
















4 


-3 


2 


1 


-3 


-3 


-3 


-3 


-2 


-1 


3 


-3 


-1 


I 


















5 


-2 


-1 


0 


-1 


1 


2 


0 


-1 


-2 


-3 


-2 


K 




















4 


2 


-3 


-3 


-2 


-2 


-2 


-1 


1 


-2 


-1 


L 






















5 


-2 


-2 


0 


-1 


-1 


-1 


1 


-1 


-1 


M 
























6 


-2 


0 


0 


1 


0 


-3 


-4 


-2 


N 


























7 


-1 


-2 


-1 


-1 


-2 


-4 


-3 


P 




























5 


1 


0 


-1 


-2 


-2 


-1 


Q 






























5 


-1 


-1 


-3 


-3 


-2 


R 
































4 


1 


-2 


-3 


-2 


S 


































5 


0 


-2 


-2 


T 




































4 


-3 


-1 


V 






































11 


2 


W 








































7 


Y 
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TABLE IV (A) 



SUPERMOTIFS 


POSITION 


POSITION 


r»/~\0 ITI AVI 

POSITION 




2 (Primary Anchor) 


3 (Primary Anchor) 


C Terminus (Primary 
Anchor) 


Al 


TIL VMS 




FWY 


A2 


LIVlVL4r<9 




1\7 A A A TV 

W MAI L 


A3 


VSMA7X/ 




KK 


A24 


YFW^/KZ.A/r 




pi VJI/r A A 


B7 


P 




%711 T7AA1J/VA 

VILr MW i A 


B27 


RHK 




PV I 11/ 1 A1\/ A 

rYL W Ml VA 


B44 


ED 




FWYL11V1VA 


B58 


ATS 




T?\\f\/ I II/kAA 

FWY Li VMA 


B62 


QLIVMP 




FWYMIVLA 











MOTIFS 








Al 


TSM 




Y 


Al 




DE AS 


Y 


A2.1 


LM VQIA T 




\LIMA T 


A3 


LMVISATFCGD 




\S -\r r% J j TP a 


All 


VTMLISAGNCD/ 7 




tsJiiri 


A24 


YFWM 




FLIW 


A ♦'2 1 A. 1 


IV1V 




RK 


A*3301 


MVALF/57 




RK 


A*6801 


AVTA/SX/ 




RK 


B*0702 


P 




luMFWYAIV 


B*3501 


P 




LMFWY/M 


B51 


P 




IAVFWYAM 


B*5301 


P 




IMFW ALV 


B*5401 


P 




ATWLMFWY 



Bolded residues are preferred, italicized residues are less preferred: A peptide is considered motif-bearing if it has 
primary anchors at each primary anchor position for a motif or supermotif as specified in the above table. 



TABLE IV (B): HLA CLASS II SUPERMOTIF 



1 


6 


9 








W, F, Y, V, .1, L 


A, V, I, L, P, C, S, T 


A, V, I, L, C, S, T, M, Y 
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TABLE V (A) 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - Al, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RFSIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 




1 


169 


WTDNIFAIK 


50.000 


4^Portio 
n oi 


2 


114 


RCETEERTR 


9.000 I 


3 


16 


MMEIFSETK 


9.000 


SEOID 

NO: 3; 


4 


195 


FGIPEDFDY 


6.250 


each 
start 
position 


5 


106 


SIEKAKIGR 


4.500 


6 


20 


FSETKDVFQ 


2.700 


7 


59 


MVDCERIGT 


2.500 


is 


8 


185 


GFEENKIDR 


2.250 


specified 

, the 
enath of 


9 


116 


ETEERTRLA 


2.250 


10 


152 


VEEIRQANK 


1.800 


each 
peotide 


11 


101 


ASLQKSIEK 


1.500 


12 


93 


LSEGSQKHA 


1.350 


is 9 
amino 
acids, 
the end 


13 


54 


LVDDGMVDC 


1.000 


14 


146 


DCDPQVVEE 


1.000 


15 


85 


KLEVLESQL 


0.900 


position 


16 


151 


VVEEIRQAN 


0.900 


for each 


17 


8 


SAEEKRTRM 


0.900 


peptide 


18 


88 


VLESQLSEG 


0.900 


is the 
start 
position 


19 


130 


LRDQREQLK 


0.500 


20 


117 


TEERTRLAK 


0.450 


plus 
eight 


21 


193 


RTFGIPEDF 


0.250 


22 


66 


GTSNYYWAF 


0.250 


23 


77 


KALHARKHK 


0.200 


24 


72 


WAFPSKALH 


0.200 


25 


138 


KAEVEKYKD 


0.180 


26 


7 


LSAEEKRTR 


0.150 


27 


126 


ELSSLRDQR 


0.100 


28 


34 


KIAPKEKGI 


0.100 


29 


61 


DCERIGTSN 


0.090 


30 


133 


QREQLKAEV 


0.090 


31 


40 


KG1TAMSVK 


0.050 


32 


22 


ETKDVFQLK 


0.050 


33 


26 


VFQLKDLEK 


0.050 


34 


136 


QLKAEVEKY 


0.050 


35 


| 197 


1PEDFDYID 


0.045 


36 


I 47 


VKEVLQSLV 


0.045 


! 37 


162 


AKEAANRWT 


0.045 


38 


186 


FEENK1DRT 


0.045 


39 


91 


SQLSEGSQK 


0.030 


40 


63 


ERIGTSNYY 


0.025 


41 


42 


ITAMSVKEV 


0.025 


42 


5 


KGLSAEEKR 


0.025 


43 


144 


YKDCDPQVV 


0.025 




44 


148 


DPQVVEEIR 


0.025 




45 


124 


AKELSSLRD 


0.022 





114 



511582003440 



TABLE V (A) 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - Al, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


ID// 


46 


175 


AIKSWAKRK 


0.020 




47 


174 


FAIKSWAKR 


0.020 


48 


30 


KDLEKIAPK 


0.020 


49 


155 


IRQANKVAK 


0.020 


50 


160 


KVAKEAANR 


0.020 



TABLE VI (A) 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - Al, 10-MERS 


RANK 


CT ART 

POSITION 


IR9FOI IFNCF 

JUDODyUDli^/L 

RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


QFO 


1 


1 16 


ETEERTRLAK 


225.000 


S-kPortio 
n of SEO 
ID NO: 


2 


151 


VVEEIRQANK 


36.000 


3 


20 


FSETKDVFQL 


6.750 


3: each 


4 


169 


WTDNIFAIKS 


6.250 


start 
position 


5 


146 


DCDPQVVEEI 


5.000 




61 


DCERIGTSNY 


4.500 


is 

specified 


7 


3 1 


DLEKIAPKEK 


1.800 


g 


93 


LSEGSQKHAS 


1.350 , 


. the 
leneth of 


9 


25 


DVFQLKDLEK 


1.000 ] 


each 
peptide 


10 


100 


HASLQKSIEK 


1.000 


11 


29 


LKDLEKIAPK 


1.000 


is 10 
amino 


i 12 


8 


SAEEKRTRMM 


I 0.900 


13 


85 


KLEVLESQLS 


0.900 


acids, the 

end 
position 


14 


88 


VLESQLSEGS 


0.900 


15 


138 


KAEVEKYKDC 


0.900 


for each 


16 


114 


RCETEERTRL 


0.900 


peptide 


17 


105 J 


KSIEKAKIGR 


0.750 


is the 
start 


18 


72 


WAFPSKALHA 


0.500 


19 


59 


MVDCERIGTS 


0.500 


position 
plus nine 


20 


186 


FEENKIDRTF 


0.450 




21 


90 


ESQLSEGSQK 


! 0.300 


22 


55 


VDDGMVDCER 


0.250 


23 


172 


NIFAIKSWAK 


0.200 


24 


96 


GSQKHASLQK 


0.150 


25 


184 


FGFEENKIDR 


0.125 


26 


194 


TFGIPEDFDY 


0.125 


27 


130 


LRDQREQLKA 


0.125 


28 


18 


EIFSETKDVF 


0.100 


29 


6 


GLSAEEKRTR 


0.100 


30 


34 


KIAPKEKGIT 


0.100 




31 


15 


RMMEIFSETK 


0.100 




32 


68 


SNYYWAFPSK 


0.100 




33 


106 


SIEKAKIGRC 


0.090 





115 



511582003440 



TABLE VI (A) 



HLA PEPTIDE SCORING RESULTS - 121 PI Fl - Al, 10-MERS 





START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


C C/"\ 


RANK 


POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


}[)// 


34 


177 


KSWAKRKFGF 


0.075 




35 


67 


TSNYYWAFPS 


0.075 




36 


54 


LVDDGMVDCE 


0.050 




37 


185 


GFEENKIDRT 


0.045 




38 


124 


AKELSSLRDQ 


0.045 




39 


152 


VEEIRQANKV 


0.045 




40 


16 


MMEIFSETKD 


0.045 




41 


154 


EIRQANKVAK 


0.040 




42 


65 


IGTSNYYWAF 


0.025 




43 


42 


ITAMSVKEVL 


0.025 




44 


23 


TKDVFQLKDL 


0.025 




45 | 


190 


KIDRTFGIPE 


0.025 




46 


58 


GMVDCERIGT 


0.025 




47 


195 


FGIPEDFDY1 


0.025 




48 


44 


AMSVKEVLQS 


0.025 




49 


47 


VKEVLQSLVD 


0.022 




50 


174 


FAIKSWAKRK 


0.020 








TABLE VII (A) 








HLA PEPTIDE SCORING RESULTS - 12 IP 1F1 - A2, 9-MERS 




RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION , 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.1D// 


1 


15 


RMMEIFSET 


155.125 


W^Portio 


i 2 


122 


RLAKELSSL 


49.134 


n of SEO 
ID NO: 3; 


3 


196 


GIPEDFDYI 


30.116 


each start 


! 4 


78 


ALHARKHKL 


21.362 


position is 


5 


27 


FQLKDLEKI 


20.290 


soecified. 


6 


172 


NIFAIKSWA 


13.901 


the length 


7 


6 


GLSAEEKRT 


7.452 


of each 


! 8 


102 


SLQKSIEKA 


5.599 i 


peptide is 
9 amino 


9 


21 


SETKDVFQL 


5.541 


acids, the 


10 


34 


KIAPKEKGI 


5.021 


end 


11 


85 


KLEVLESQL 


4.785 


position 


12 


42 


ITAMSVKEV 


3.777 


for each 


13 


129 


SLRDQREQL 


3.262 


peptide is 
the start 


14 


54 


LVDDGMVDC 


2.787 


position 


15 


18 


EIFSETKDV 


2.654 


plus eieht 


16 


115 


CETEERTRL 


1.703 




17 


150 


QVVEEIRQA 


0.820 




18 


46 


SVKEVLQSL 


0.617 




19 


139 


AEVEKYKDC 


0.594 




20 


65 


IGTSNYYWA 


0.455 




21 


59 


MVDCERIGT 


0.443 





511582003440 



TABLE VII (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - A2, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.1D// 


22 


51 


LQSLVDDGM 


0.420 




23 


189 


NKIDRTFGI 


0.345 


24 


92 


QLSEGSQKH 


0.306 


25 


28 1 


QLKDLEKIA 


0.292 


26 


24 


KDVFQLKDL 


0.239 


27 


43 


TAMSVKEVL 


0.221 


28 


52 


OSLVDDGMV 


0.218 


29 


50 


VLQSLVDDG 


0.143 


30 


153 


EEIRQANKV 


0.101 


31 


70 


YYWAFPSKA 


0.100 


32 


168 


RWTDNIFAI 


0.079 


33 


177 


KSWAKRKFG 


0.078 


34 


144 


YKDCDPQVV 


0.073 


35 


165 


AANRWTDNI 


0.071 


36 


157 


QANKVAKEA 


0.069 


37 


64 


RJGTSNYYW 


0.056 


38 


186 


FEENKIDRT 


0.048 


39 


167 


NRWTDNIFA 


0.031 


40 


183 


KFGFEENKI 


0.025 


41 


99 


KHAoLQKol 




42 


53 


SLVDDGMVD 


0.025 


43 


88 


VLESQLSEG 


0.019 


44 


8 


SAEEKRTRM 


0.018 


45 


58 


GMVDCERIG 


0.018 


46 


72 


WAFPSKALH 


0.018 


47 


147 


CDPQVVEEI 


0.016 


48 


104 


OKSIEKAKI 


0.014 


49 


71 


YWAFPSKAL 


0.014 


1 50 


195 


FG1PEDFDY 


0.013 





TABLE VIII (A) 








HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 10-MERS 




RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


ID// 


1 


53 


SLVDDGMVDC 


46.848 


-14+rPortio 


i 2 


58 


GMVDCER1GT 


22.066 


nofSEO 
ID NO: 3; 


3 


41 


GITAMSVKEV 


21.996 


each start 


4 


92 


QLSEGSQKHA 


20.369 


position is 


5 


64 


RIGTSNYYWA 


5.636 


specified. 


6 


50 


VLQSLVDDGM 


4.138 


the length 


7 


77 


KALHARKHKL 


3.842 


of each 


8 


27 


FQLKDLEKIA 


3.515 


peptide is 
10 amino 


9 


17 


MEIFSETKDV 


2.299 


acids, the 



position 
for each 
peptide is 
the start 
Position 



511582003440 



TABLE VIII (A) 



HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 10-MERS 



< 

RANK 

] 


START ! 


SUBSEQUENCE J 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION * 




POSITION 1 


RESIDUE LISTING < 


DF A MOLECULE CONTAINING THIS SUBSEQUENCE) J 




10 


195 


FGIPEDFDYI 


1.604 




11 


51 


LQSLVDDGMV 


1 .558 




12 


72 


WAFPSKALHA 


1 1 "7 A 

1.174 




13 


46 1 


SVKEVLQSLV 


0.873 




14 


5 


KGLSAEEKRT 


a cy A 

0.630 




15 


20 


FSETKDVFQL 


A C AO 

0.548 




16 


45 


MSVKEVLQSL 


0.545 




17 


156 


RQANKVAKEA 


0.504 




18 


94 


SEGSQKHASL 


0.415 




19 


15 


RMMEIFSETK 


0.304 




20 


128 


SSLRDQREQL 


0.253 j 




21 I 


7 


LSAEEKRTRM 


0.226 




22 


34 


KIAPKEKGIT 


0.191 




23 


38 


KEKGITAMSV 


0.166 




24 j 


132 


DQREQLKAEV 


0.165 




25 


167 


NRWTDN1FAI 


0.160 




26 


152 


VEEIRQANKV 


0.147 




27 


101 


ASLQKSIEKA 


0.135 




28 


44 


AMSVKEVLQS 


0.124 




29 


35 


IAPKEKGITA 


0.1 17 




30 


70 


YYWAFPSKAL 


0.1 13 




! 31 


42 


ITAMSVKEVL 


0.101 




32 


79 


LHARKHKLEV 


0.082 




33 


177 


KSWAKRKFGF 


0.082 




34 


115 


CETEERTRLA 


0.079 




35 


103 


LQKSIEKAKI 


0.063 




36 


172 


NIFAIKSWAK 


0.057 




37 


182 


RKFGFEENKI 


0.054 




38 


157 


OANKVAKEAA 


0.034 




39 


91 


SQLSEGSQKH 


1 0.028 




40 


161 


VAKEAANRWT 


0.028 




41 


23 




0.027 




42 


150 


OVVEEIRQAN 


0.027 




43 


121 


TRLAKELSSL 


^ 0.025 




44 


142 


EKYKDCDPQV 


0.023 




45 


138 


KAEVEKYKDC 


0.023 




46 


160 


KVAKEAANRW 


0.023 




47 


87 


EVLESQLSEG 


0.017 




48 


85 


KLEVLESQLS 


j 0.017 




49 


84 


HKLEVLESQL 


1 0.015 




50 


102 


SLQKSIEKAK 


0.015 





118 



511582003440 



TABLE IX (A) 








HLA PEPTIDE SCORING RESULTS - 121P1F1 - A3, 9-MERS 




< 

RANK ' 
i 

J 


START 5 


SUBSEQUENCE I 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION { 








")F A MOLECULE CONTAINING THIS SUBSEQUENCE) \ 


m 


1 


16 


MMEIFSETK 


60.000 


01 Pnrtin 


2 


136 


OLKAEVEKY 


12.000 


n of SEO 
D NO: 3; 


3 


169 


WTDNIFA1K 


4.500 


each start 


4 


175 


AIKSWAKRK 


3.000 j 


position is 


5 


66 


GTSNYYWAF 


2.700 


specified. 


6 


85 


KLEVLESQL 


1.800 


the length 


7 ! 


22 


ETKDVFQLK 


1.350 


r»f each 


8 


97 


SQKHASLQK 


1.200 j 


peptide is 
9 amino 


9 


160 


KVAKEAANR 


1.200 


acids, the 


10 


126 


ELSSLRDQR | 


1.200 


end 


11 


193 


RTFGIPEDF 


1.125 


position 


12 


15 : 


RMMEIFSET 


1.012 


fr\r e^ar'Yx 
IUI Cddl 


13 


122 


RLAKELSSL 


0.900 


peptide is 
the start 


14 


91 


SQLSEGSQK 


0.900 


position 


15 


196 


GIPEDFDYI 


0.810 


plus eight 


16 


106 


SIEKAKIGR 


0.800 




17 


78 


ALHARKHKL 


0.600 




18 


129 


SLRDQREQL 


0.600 




19 


77 


KALHARKHK 


0.450 




1 20 


103 


LOKSIEKAK 


0.450 




21 


182 


RKFGFEENK 


0.450 




22 


102 


SLQKSIEKA 


0.300 




23 


92 


QLSEGSQKH 


L 0300 




24 


101 


ASLQKSIEK 


0.300 




25 


69 


NYYWAFPSK 


0.300 




26 


135 


EOLKAEVEK 


0.270 




27 


30 


KDLEKIAPK 


*~ 0.203 




28 


46 


SVKEVLQSL 


P 0.203 




29 


172 


NIFAIKSWA 


0.150 




30 


6 


GLSAEEKRT 


0.150 




31 


40 


KG1TAMSVK 


0.135 




32 


34 


KIAPKEKGI 


T 0.135 




33 


117 


TEERTRLAK 


0.120 




34 


28 


OLKDLEKIA 


0.100 




35 


4 


KKGLSAEEK 


0.060 




36 


173 


IFAIKSWAK 






37 


50 


VLOSLVDDG 


0.060 




38 


174 


FA1KSWAKR 


0.060 




39 


152 


VEEIRQANK 


0.060 




40 


64 


RIGTSNYYW 


0.060 




41 


123 


LAKELSSLR 


0.060 




42 


74 


FPSKALHAR 


0.060 




43 


53 


SLVDDGMVD 


i 0.060 




44 


27 


FQLKDLEKI 


j 0.041 




45 


26 


VFQLKDLEK 


0.040 





119 



511582003440 



TABLE IX (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - A3, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. 


46 


185 


GFEENKIDR 


0.036 




47 


54 


LVDDGMVDC 


0.030 


48 


32 


LEKIAPKEK 


0.030 


49 


88 


VLESQLSEG 


0.030 


50 


195 


FGIPEDFDY 


0.027 



TABLE X (A) 








HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 10-MERS 




< 

RANK 

] 


START I 
POSITION 


SUBSEQUENCE ! 
RESIDUE LISTING < 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION I 
Dr A MULhLULb L/VJIN 1 AlINlINvj Inlo oudocvuein^c; 1 


rD# 


1 


15 


RMMEIFSETK 


135.000 


^WPortio 

n nf <sPO 


2 


172 


NIFAIKSWAK 


30.000 


n or jgv 
D NO: 3; 


3 


129 


SLRDQREQLK 


20.000 


each start 


4 


136 


QLKAEVEKYK 


15.000 


position is 


5 


102 


SLQKSIEKAK 


15.000 


specified. 


6 


25 


DVFQLKDLEK 


6.000 


the length 


7 


122 


RLAKELSSLR 


4.000 


of each 

npntiHp i^ 


8 


31 


DLEKIAPKEK 


3.000 


10 amino 


9 


151 


VVEEIRQANK 


3.000 


acids, the 


10 


6 


GLSAEEKRTR 


1.200 


end 


11 


111 


KIGRCETEER 


1.200 
0.900 


position 
for each 


12 
13 


58 
116 


GMVDCERIGT 
ETEERTRLAK 


0.900 


r\ r> r\ 1 1 iH r> ic 

pcpiiuc lb 

the start 


14 


154 


EIRQANKVAK 


0.600 


position 


15 


96 


GSOKHASLQK 


1 0.600 


plus nine 


16 


68 


SNYYWAFPSK 


0.600 




17 


53 


SLVDDGMVDC 


0.450 




18 


174 


FAIKSWAKRK 


0.450 




19 


177 


KSWAKRKFGF 


0.450 




20 


100 


HASLQKSIEK 


[ 0.400 




21 


50 


VLQSLVDDGM 


0.300 




22 


18 


EIFSETKDVF 


0.300 




23 


105 


KSIEKAKIGR 


0.270 




24 


21 


SETKDVFQLK 


0.270 




25 


44 


AMSVKEVLQS 


0.240 




26 


74 


FPSKALHARK 


0.200 




27 


181 


KRKFGFEENK 


| 0.180 




28 


135 


EQLKAEVEKY 


0.162 




29 


92 


QLSEGSQKHA 


0.150 




30 


85 


KLEVLESQLS 


0.120 




31 


3 


KKKGLSAEEK 


0.090 




32 


168 


RWTDNIFAIK 


0.090 




33 


41 


GITAMSVKEV 


0.090 





120 



511582003440 



TABLE X (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - A3, 10-MERS 



RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 




POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


IDA 


34 


196 


GIPEDFDYID 


0.081 




35 [ 


184 


FGFEENKIDR 


0.060 




36 


134 


REOLKAEVEK 


0.060 




37 


64 


RIGTSNYYWA 


0.060 




38 


160 


KVAKEAANRW 


0.060 




39 


125 


KELSSLRDQR 


0.054 




40 


42 


ITAMSVKEVL 


0.045 




41 


28 


L» Iv U L. fc, K I A r 


0.040 




42 


88 


VLESQLSEGS 


0.040 




43 


190 


KIDRTFGIPE 


0.036 




44 


29 


LKDLEKIAPK 


0.030 




45 


46 


SVKEVLQSLV 


0.030 




46 


72 


WAFPSKALHA 


0.030 




47 


90 


ESQLSEGSQK 


0.030 




48 


77 


KALHARKHKL 


0.027 




49 




FSETKDVFQL 


0.027 




50 


165 


AANRWTDNIF 


0.020 





TABLE XI (A) 








HLA PEPTIDE SCORING RESULTS - 12 1 PI Fl -A11,9-MERS 




RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


SEQ. 


POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


\Dff 


1 


160 


KVAKEAANR 


1.200 


WkPortio 


2 


97 


SQKHASLQK 


1.200 


n ofSEO 
ID NO: 3; 


3 


169 


WTDNIFAIK 


1.000 


each start 


4 


91 


SQLSEGSQK 


0.900 


position is 


5 


69 


NYYWAFPSK 


0.800 


specified. 


6 


77 


KALHARKHK 


0.450 


the leneth 


7 


16 


MMEIFSETK 


0.400 


of each 
peptide is 


8 


173 


IFAIKSWAK 


0.400 


9 amino 


9 


26 


VFQLKDLEK 


0.400 


acids, the 


10 


103 


LQKSIEKAK 


0.300 


end 


11 


22 


ETKDVFQLK 


0.300 


position 


12 


135 


EQLKAEVEK 


0.270 


for each 


13 


185 


GFEENKIDR 


0.240 


peptide is 
the start 


14 


175 


AIKSWAKRK 


0.200 


position 


15 


106 


SIEKAKIGR 


0.160 


plus eifiht 


16 


182 


RKFGFEENK 


0.120 




17 


117 


TEERTRLAK 


0.120 




18 


40 


KGITAMSVK 


0.090 




19 


30 


KDLEKIAPK 


0.090 




20 


101 


ASLQKSIEK 


0.060 




21 


1 4 


KKGLSAEEK 


0.060 





121 
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TABLE XI (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - Al 1, 9-MERS 





START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING ( 


SCORE (ESTIMATE OF HALF TIME Or DISAbaUCIA 1 1UN 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


DA 


22 


152 


VEEIRQANK 


u.uou 




23 


174 


FAIKSWAKR 


a a^a 
U.UOU 




24 


66 


GTSNYYWAF 


A HAH 

u.uou 




25 


193 


RTFGIPEDF 


a aaa 
u.uou 




26 


123 


LAKELSSLR 


U.U4U 




27 


74 


FPSKALHAR 


U.U4U 




28 


32 


LEKIAPKEK 


A A'JA 
U.UJU 




29 


126 


ELSSLRDQR ! 


A Al/f 
U.UZ4 




30 


64 


RIGTSNYYW 


A AO/1 
U.UZ4 




31 


46 


SVKEVLQSL 


A AOA 

U.UZU 




32 


155 


IRQANKVAK 


A AOA 
U.UZU 




33 


130 


LRDQREQLK 


A AOA 
U.UZU 




34 


5 


KGLSAEEKR 


A Al 9 
U.U 1 5 




35 


114 


RCETEERTR 


A Al O 
U.U 1Z 




36 


148 


DPQVVEEIR 


A Al O 
U.U1Z 




37 


196 


GIPEDFDYI 


A A 1 O 
U.U IZ 




38 


1 85 


KLEVLESQL 


A Al O 
U.U 1 z 




39 


122 


RLAKELSSL 


U.U 1 z 




40 


143 


KYKDCDPQV 


A A 1 O 
U.U I Z 




A 1 


1 j / 


I KAFVFKYK 


0.010 




42 


27 


FQLKDLEKI 


0.009 




43 


172 


NIFAIKSWA 


0.008 




44 


70 


YYWAFPSKA 


0.008 




45 


34 


KIAPKEKGI 


0.006 




46 


51 


LQSLVDDGM 


0.006 




47 


13 


RTRMMEIFS 


0.006 




48 


183 


KFGFEENKI 


0.006 




49 


42 


ITAMSVKEV 


0.005 




50 


136 


OLKAEVEKY 


0.004 





TABLE XII (A) 

HLA PEPTIDE SCORING RESU LTS - 1 2 1 P 1 F 1 - A 1 1 , 1 0-M ERS 




RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF D15ASMJU1A i lON 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


ID// 


1 


15 


RMMEIFSETK 1 


2.400 


^kPorti 
on of 


2 


25 


DVFQLKDLEK 


2.400 


SEO ID 


3 


151 


VVEEIRQANK 


2.000 


NO: 3; 


4 


172 


NIFAIKSWAK 


^ " L600 


each start 


5 


116 


ETEERTRLAK 


0.600 


position 


6 


100 


HASLQKSIEK 


0.400 
0.400 


is 

specified 


7 
8 


129 
111 


SLRDQREQLK 
KIGRCETEER 


r 0.240 


. the 
leneth of 


9 


122 


RLAKELSSLR 


0.240 

122 


each 
peptide 
is 10 



amino 
acids, the 

end 
position 



511582003440 



TABLE XII (A) 



HLA PEPTIDE SCORING RESULTS - 121P1FI -All, 1 0-MERS 



< 
1 


START I 
POSITION 1 


SUBSEQUENCE I 
RESIDUE LISTING ( 


:>CORE (ESTIMATE Or HALr 1 1Mb Ur Ul^AooU^i/\i iwim t 
DF A MOLECULE CONTAINING THIS SUBSEQUENCE) 4 


: PQ 


10 


136 


QLKAbvbKYK. 






11 


102 i 


SLQKSIEKAK 


0 900 




12 


74 


rnoi/ At had \S 

FPSKALHARK 


0 900 




13 


134 


REQLKAEVLK 


0 1 RO 
U. 1 ou 




14 


174 


FAIKSWAKRK i 


O 1 SO 
U. 1 JU 




15 


96 


GSQKHASLQK 


O 1 90 
U. 1 ZU 




16 


154 


EIRQANKVAK 


O 190 




17 


68 


SNYYWAFPSK 


O ORO 
U.UoU 




18 


181 


KRKFGFEENK 


O OAO 
U.UOU 




19 


3 


KKKGLSAEEK 


O 060 




20 


168 


RWTDNIFAIK 


U.UOU 




21 


21 


SETKDVFQLK 


O OAO 
U.UOU 




22 


31 


DLEKIAPKEK 


O OAO 
U.UOU 




23 


160 


KVAKEAANRW 


O OAO 
U.UOU 




24 


125 


KELSSLRDQR 


U.UDH 




25 


73 


AFPSKALHAR 


O OAO 

u.u^u 




26 


173 


IFA1KSWAKR 


0 040 
U.UHU 




27 


105 


KSIEKAKIGR 


0 0^6 




28 


6 


GLSAEEKRTR 


0 094 




29 


64 


RIGTSNYYWA 


0 094 




30 


29 


LKDLEKIAPK 


0 090 
u.uzu 




31 


46 


SVKEVLQSLV 


O 090 
u.uzu 




32 


184 


FGFEENKIDR 


O 01 A 
U.U 1 0 




33 


4 


KKGLSAEEKR 


0 019 
U.U 1 z 




34 


143 


KYKDCDPQVV 


0 019 
U.U 1 z 




35 


42 


* m Am* /-> -m ¥ ¥ 7 I"" 1 1 FT 

ITAMSVKEVL 


0 010 
U.U 1 u 




36 


76 


SKALHARJCHK 


0 010 
U.U 1 u 




37 


156 


RQANKVAKEA 


0 000 




38 


77 


KALHARKHKL 


0 000 




39 


13 


RTRMMEIFSE 


0 000 
u.uuy 




40 


x. 91 


SQLSEGSQKH 


0 000 
u.uuy 




A 1 




NYYWAFPSKA 


0.008 




42 


72 


WAFPSKALHA 


0.008 




43 


159 


NKVAKEAANR 


; 0.006 




44 


39 


EKGITAMSVK 


0.006 




45 


114 


RCETEERTRL 


0.006 




46 


120 


RTRLAKELSS 


0.006 




47 


51 


LQSLVDDGMV 


Y 0006 




48 


90 


ESQLSEGSQK 


0.006 




49 


103 


LOKSIEKAKI 


0.006 




50 


193 


RTFGIPEDFD 


0.006 





123 
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TABLE XIII (A) 

HLA PEPTIDE SCORING RESULTS- 12 1 PI Fl - A24, 9-MERS 




< 

RANK.' 

I 

I 


START i 
J UM 1 1UN 1 


SUBSEQUENCE S 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION £ 
">F A MOI FCI II F CONTAINING THIS SUBSEOUENCE) J 


m 


1 


85 


KLEVLESQL 


14.400 4 


on of 
SEO ID 


2 


183 


KFGFEENKI 


13.200 


3 


143 


KYKDCDPQV 


12.000 


NO: 3; 


4 


19 


IFSETKDVF 


12.000 


;ach start 


5 


43 


TAMSVKEVL 


8.400 


position 


6 


46 


SVKEVLQSL 


8.064 

8.000 * 


is 

specified 


7 
8 


122 
193 


RLAKELSSL 
RTFGIPEDF 


5.600 


„ the 
encth of 


9 


70 


YYWAFPSKA 


5.500 


each 
peotide 


10 


129 


SLRDQREQL 


4.800 


11 


78 


ALHARKHKL 


4.400 


is 9 
amino 


12 


71 


YWAFPSKAL 


4.000 


13 


95 


EGSQKHASL 


4.000 


acids, the 

end 
position 


14 


166 


ANRWTDNIF 


2.400 


15 


34 


KIAPKEKGI 


2.400 


for each 


L 16 


168 


RWTDNIFAI 


2.400 


peotide 


17 


196 


GIPEDFDYI 


2.160 


is the 
start 


18 


178 


SWAKRKFGF 


2.000 


19 


66 


GTSNYYWAF 


2.000 


position 
plus 
eight 


20 


27 


FQLKDLEKI 


1.650 


21 


165 


AANRWTDNI 


1.500 


22 


57 


DGMVDCERI 


1.500 


23 


24 


KDVFQLKDL 


1.200 


24 


8 


SAEEKRTRM 


0.900 


25 


73 


AFPSKALHA 


0.750 


26 


51 


LQSLVDDGM 


0.700 


27 


15 


RMMEIFSET 


0.665 


28 


69 


NYYWAFPSK 


0.600 


29 


119 


ERTRLAKEL 


0.528 


30 


115 


CETEERTRL 


0.480 


31 


187 


EENKIDRTF 


0.420 


32 


12 


KRTRMME1F 


0.400 


! 33 


81 


ARKHKLEVL 


, 0.400 




34 


21 


SETKDVFQL 


0.400 




35 


151 


VVEEIRQAN 


0.302 




36 


99 


KHASLQKS1 


ft 940 
U.ZH-U 




37 


147 


CDPQVVEEI 


0.231 




38 


157 


OANKVAKEA 


0.231 




39 


176 


IKSWAKRKF 


0.220 




40 


109 


KAKIGRCET 


0.220 




41 


61 


DCERIGTSN 


, 0.210 




42 


13 


RTRMME1FS 


0.200 




43 


120 


RTRLAKELS 


0.200 




44 


64 


RIGTSNYYW 


0.200 




45 


189 


NKIDRTFGI 


0.180 





124 
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TABLE XIII (A) 



HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A24, 9-MERS 



RANK 


START 1 
POSITION 3 


SUBSEQUENCE ! 
RESIDUE LISTING < 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
DF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. 


46 


150 


OVVEEIRQA 


0.180 




47 


195 


FGIPEDFDY 


0.180 




48 


116 


ETEERTRLA 


0.180 




49 


102 


SLQKSIEKA 


0.165 




50 


171 


DNIFAIKSW 


0.150 








TABLE XIV (A) 








HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 10-MERS 




RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION . 
OF A MOLECULE CONTAINING THIS bUBbbOUbNCfcj 




1 


70 


YYWAFPSKAL 


200.000 


*54^Portio 
n of SEO 




143 


KYKDCDPQVV 


14.400 


2 


IDNO: 3; 


3 


77 


KALHARKHKL 


13.200 


each start 


4 


114 


RCETEERTRL 


12.000 


rin^itinn 1*5 

LIU31 11 XJl 1 IJ 


5 


45 


MSVKEVLQSL 


10.080 


specified. 


6 


26 


VFQLKDLEKI 


8.250 


the length 


7 


20 


FSETKDVFQL 


6.000 


of each 


8 


128 


SSLRDQREQL 


6.000 


peotide is 
10 amino 


9 


42 


ITAMSVKEVL 


5.600 


dCIUo. INC 


10 


69 


NYYWAFPSKA 


5.500 


end 


11 


80 


HARKHKLEVL 


4.000 


position 


12 


177 


KSWAKRKFGF 


4.000 


for each 


13 


165 


AANRWTDNIF 


3.600 


peptide is 
the start 


14 


175 


AIKSWAKRKF 


2.200 


position 


15 


195 


FGIPEDFDY I 


2.160 


plus nine 


16 


18 


EIFSETKDVF 


2.000 




17 


65 


IGTSNYYWAF 


2.000 




18 


146 


DCDPQVVEEI 


1.848 




19 


103 


LQKSIEKAKI 


1.100 




20 


50 


VLQSLVDDGM 


1.050 




21 


188 


ENKIDRTFGI 


1.000 




22 


164 


EA ANRWTDN I 


1.000 




23 


8 


SAEEKRTRMM 


0.900 




24 


185 


GFEENKIDRT 


0.900 




25 


84 


HKLEVLESQL 


| 0.864 




26 


121 


TRLAKELSSL 


0.600 




27 


36 


APKEKGITAM 


0.600 




28 


7 


LSAEEKRTRM 


| 0.600 




29 


118 


EERTRLAKEL 


0.528 




30 


194 


} TFGIPEDFDY 


0.500 




31 


! 186 


FEENK1DRTF 


0.420 




32 


23 


TKDVFQLKDL 


0.400 




33 


94 


SEGSQKHASL 


0.400 





125 
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TABLE XIV (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 10-MERS 



RANK. 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 




34 


85 


KLEVLESQLS 






35 


156 


RQANKVAKEA 


U.JUo 


36 


150 


QVVEEIRQAN 




37 


138 


KAEVEKYKDC 


U.JUU 


38 


5 


KGLSAEEKRT 


a taa 
U.3UU 


39 


192 


DRTFGIPEDF 


n TQA 

U.28U 


40 


182 


RKFGFEENKI 




A 1 


JH 




0.240 


42 


160 


KVAKEAANRW 


0.240 


43 


171 


DNIFAIKSWA 


0.210 


44 


64 


RIGTSNYYWA 


0.200 


45 


11 


EKRTRMMEIF 


0.200 


46 


120 


RTRLAKELSS 


0.200 


47 


27 


FQLKDLEKIA 


0.180 


48 


88 


VLESQLSEGS 


0.180 


49 


58 


GMVDCER1GT 


0.180 


50 


53 


SLVDDGMVDC 


0.180 



TABLE XV (A) 








HLA PEPTIDE SCORING RESULTS - 121 P1F1 - B7, 9-MERS 




RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOC1A 1 1UN 




POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


IP// 


1 


129 


SLRDQREQL 


60.000 


§04rPortio 


2 


43 


TAMSVKEVL 


36.000 


n of SEO 
IDNO: 3; 


3 


46 


SVKEVLQSL 


20.000 


each start 


4 


78 


ALHARKHKL 


12.000 


position is 


5 


36 


APKEKGITA 


6.000 


specified. 


6 


80 


HARKHKLEV 


6.000 


the length 


7 


122 


RLAKELSSL 


4.000 


of each 


8 


95 


EGSQKHASL 


4.000 ! 


peptide is 
9 amino 


9 


165 


AANRWTDNI 


L_ 3 * 60 ° 


acids, the 


10 


8 


SAEEKRTRM 


1.350 


end 


11 


85 


KLEVLESQL 


1.200 


position 


12 


81 


ARKHKLEVL 


1.200 


for each 


13 


57 


DGMVDCER1 


1.200 


peptide is 
the start 


14 


51 


LQSLVDDGM 


1.000 


position 


15 


154 


EIRQANKVA 


1.000 


plus eifcht 


16 


115 


CETEERTRL 


0.600 




17 


71 


YWAFPSKAL 


0.600 




18 


166 


ANRWTDN1F 


0.600 




19 


150 


QVVEEIRQA 


0.500 




20 


109 


KAKIGRCET 


0.450 




21 


27 


FQLKDLEK1 


0.400 





126 
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TABLE XV (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 9-MERS 



RANK. 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. 

IP// 


22 


11 


EKRTRMMEI 


0.400 




23 


21 


SETKDVFQL 


0.400 


24 


196 


GIPEDFDYI 


0.400 


25 


34 


KIAPKEKGI 


0.400 


26 ! 


119 


ERTRLAKEL 


0.400 


27 


24 


KDVFQLKDL 


0.400 


28 


35 


IAPKEKGIT 


0.300 


29 


15 


RMMEIFSET j 


0.300 


30 


158 


ANKVAKEAA ! 


0.300 


31 


L 157 


QANKVAKEA ! 


0.300 


32 


59 


MVDCERIGT 


0.225 


33 


148 


DPQVVEEIR 


0.200 


34 


18 


EIFSETKDV 


0.200 


35 


52 


QSLVDDGMV 


0.200 


36 


74 


FPSKALHAR 


0.200 


37 


120 


RTRLAKELS 


0.200 


38 


13 


RTRMMEIFS 


0.200 


39 


42 


ITAMSVKEV 


0.200 


40 


54 


LVDDGMVDC 


0.150 


41 


65 


1G I SNYYWA 


n i no 


42 


102 


SLQKSIEKA 


0.100 


43 


132 


DQREQLKAE 


0.100 


44 


1 


MSKKKGLSA 


0.100 


45 


112 


IGRCETEER 


0.100 


46 


6 


GLSAEEKRT 


^ 0.100 


47 


28 


OLKDLEKIA 


0.100 


48 


172 


NIFAIKSWA 


0.100 


49 


9 


AEEKRTRMM 


0.090 


50 


164 


EAANRWTDN 


0.060 



TABLE XVI (A) 

HLA PEPTIDE SCORING RESULTS - 121P1F1-B7, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


\\yff 


1 


80 


HARKHKLEVL 


120.000 


^^Portio 


2 


36 


APKEKGITAM 


60.000 


n ofSEO 
ID NO: 3: 


3 


77 


KALHARKHKL 


12.000 


each start 


4 


128 


SSLRDQREQL 


6.000 , 


position is 


5 


42 


ITAMSVKEVL 


4.000 


specified. 


6 


45 


MSVKEVLQSL 


4.000 


the leneth 


7 


118 


EERTRLAKEL 


4.000 


of each 


8 


166 


ANRWTDNIFA 


3.000 


peptide is 
10 amino 


9 


132 


DQREQLKAEV 


2.000 


acids, the 



1 27 e 

position 

for each 

peptide is 

the start 

position 



511582003440 



TABLE XVI (A) 



HLA PEPTIDE SCORING RESULTS - 12 IP 1F1 - B7, 10-MERS 



RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


^EQ 


POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 




10 


114 


RCETEERTRL 


1.800 




i 11 


7 


LSAEEKRTRM 


1.500 




12 


20 


FSETKDVFQL 


1.200 




13 


164 


EAANRWTDNI 


1.200 




14 


46 


SVKEVLQSLV 


1.000 




15 


50 


VLQSLVDDGM 


1.000 




16 


112 


IGRCETEERT 


1.000 




17 


8 


SAEEKRTRMM 


0.900 




18 


70 


YYWAFPSKAL 


0.600 




19 


94 J 


SEGSQKHASL 


0.400 




20 


188 


ENKIDRTFGI 


0.400 




21 


103 


LQKSIEKAKI 


0.400 




22 


121 


TRLAKELSSL 


0.400 




23 


195 


FGIPEDFDYI 


0.400 




24 


84 


HKLEVLESQL 


0.400 




25 


72 


WAFPSKALHA 


0.300 




26 


35 


IAPKEKGITA 


0.300 




27 


101 


ASLQKSIEKA 


0.300 




28 


157 


QANKVAKEAA 


0.300 




29 


161 


VAKEAANRWT 


0.300 




30 


120 


RTRLAKELSS 


0.200 




31 


41 


GITAMSVKEV 


0.200 




32 


148 


DPQVVEEIRQ 


0.200 




33 


51 


LQSLVDDGMV 


0.200 




34 


74 


FPSKALHARK 


0.200 




35 


165 


AANRWTDNIF 


0.180 




36 


58 


GMVDCERIGT 


0.150 




37 


150 


OVVEEIRQAN 


0.150 




38 


23 


TKDVFQLKDL 


0.120 




39 


146 


DCDPQVVEEI 


0.120 




40 


34 


KIAPKEKGIT 


0.100 




41 


27 


FQLKDLEKIA 


n inn 




42 


53 


SLVDDGMVDC 


0.100 




43 


13 


RTRMMEIFSE 


0.100 




44 


156 


RQANKVAKEA 


0.100 




45 


154 


EIRQANKVAK 


0.100 




46 


5 


KGLSAEEKRT 


0.100 




47 


92 


QLSEGSQKHA 


0.100 




48 


160 


KVAKEAANRW 


0.100 




49 


64 


RIGTSNYYWA 


0.100 




50 


129 


SLRDQREQLK 


0.100 





128 
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TABLE XVII (A) 








HLA PEPTIDE SCORING RESULTS -121P1F1-B35, 9-MERS 




RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


qpO 


POSITION 


DCCIHl TXT I 1CT rM f~l i 

tvbMDUb Lib 1 liNLi \ 






1 


36 


APKEKGITA 


12.000 


SQ+rPortio 
n oi bbU 
ID NO: 3; 


2 


136 


QLKAEVEKY 


9.000 


3 


161 


VAKEAANRW 


9.000 j 


each start 


4 


129 


SLRDQREQL 


6.000 


position is 


5 


46 


SVKEVLQSL 


6.000 


specified, 


6 


8 


SAEEKRTRM 


3.600 


the lenath 


7 


166 


ANRWTDNIF 


3.000 


of each 


8 


195 


FGIPEDFDY 


3.000 


peptide is 
9 amino 


9 


43 


TAMSVKEVL 


3.000 


acids, the 


10 


122 


RLAKELSSL 


3.000 


end 


11 


51 


LQSLVDDGM 


2.000 


oosition 


12 


193 


RTFGIPEDF 


2.000 


for each 


13 


80 


HARKHKLEV 


1.800 


peptide is 

the <;tart 


14 


109 


KAKIGRCET 


1.800 


Li i \^ oiai i 

position 


15 


52 


QSLVDDGMV 


1.500 


plus eight 


16 | 


1 


MSKJCKGLSA 


1.500 




17 


196 


GIPEDFDYI 


1.200 




18 


165 


AANRWTDNI 


1.200 




19 


66 


GTSNYYWAF 


1.000 




20 | 


78 


ALHARKHKL 


1.000 




21 


95 


EGSQKHASL 


1.000 




22 


64 


RIGTSNYYW 


1.000 




23 


34 


KIAPKEKGI 


0.800 




24 


45 


MSVKEVLQS 


0.750 




25 


57 


DGMVDCERI 


0.600 




26 


120 


RTRLAKELS 


0.600 




27 


13 


RTRMMEIFS 


0.600 




28 


28 


OLKDLEKIA 


0.600 




! 29 


27 


FQLKDLEKI 


0.600 




30 


85 


KLEVLESQL 


0.600 




31 


62 


CERIGTSNY 


0.600 




32 


171 


DNIFAIKSW 


0.500 




33 


35 


IAPKEKGIT 


0.450 




34 


15 


RMMEIFSET 


0.400 




35 


154 


EIRQANKVA 


0.300 




36 


157 


OANKVAKEA 


0.300 




37 


150 


QVVEEIRQA 


0.300 




38 


115 


CETEERTRL 


0.300 




39 


158 


ANKVAKEAA 


0.300 




40 


164 


EAANRWTDN 


j 0.300 




41 


81 


ARKHKLEVL 


0.300 




42 


18 


EIFSETKDV 


0.300 




43 


143 


KYKDCDPQV 


0.240 




44 


42 


ITAMSVKEV 


0.200 




I 45 


105 


KSIEKAKIG 


0.200 
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TABLE XVII (A) 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B35, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. 
IDA 


46 


74 


FPSKALHAR 


0.200 




47 


148 


DPQVVEEIR 


0.200 


48 


12 


KRTRMMEIF 


0.200 


49 


24 


KDVFQLKDL 


0.200 


50 


63 


ERIGTSNYY 


0.200 



TABLE XVIII (A) 








HLA PEPTIDE SCORING RESULTS - 121P1F1 -B35, 10-MERS 




RANK 


START 


SUBSEOUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


cpQ 

■J L_i\^ . 


POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 




1 


36 


APKEKGITAM 


240.000 


65-kPortio 
n of SEO 
ID NO: 3; 


2 


7 ; 


LSAEEKRTRM 


20.000 


3 


177 


KSWAKRKFGF 


10.000 


CdCIl Md.IL 


4 


80 


HARKHKLEVL 


9.000 


position is 


5 


77 


KALHARKHKL 


6.000 


specified. 


6 


45 


MSVKEVLQSL 


5.000 


the length 


7 


128 


SSLRDQREQL 


5.000 


of each 


s 


8 


SAEEKRTRMM 


3.600 


peptide is 
1 0 amino 


9 


175 


AIKSWAKRKF 


3.000 


acids, the 


10 


165 


AANRWTDNIF 


3.000 


end 


11 


135 


EQLKAEVEKY 


3.000 


Dosition 


12 


20 


FSETKDVFQL 


2.250 


for each 


13 


50 


VLQSLVDDGM 


2.000 


peptide is 
the start 


14 


161 


VAKEAANRWT 


1.800 


position 


15 


103 


LQKSIEKAKI 


1.800 


olus nine 


16 


132 


DQREQLKAEV 


1.200 




17 


188 


ENKIDRTFGI 


1.200 




18 


46 


SVKEVLQSLV 


1.200 




19 


164 


EAANRWTDNI 


! 1.200 




20 


65 


IGTSNYYWAF 


1.000 




21 


42 


ITAMSVKEVL 


[ 1.000 




22 


160 


KVAKEAANRW 


1.000 




23 


18 


EIFSETKDVF 


1.000 




24 


114 


RCETEERTRL 


0.900 




25 


120 


RTRLAKELSS 


0.600 




26 


62 


CER1GTSNYY 


0.600 




27 


61 


DCERIGTSNY 


0.600 




28 


195 


FGIPEDFDYI 


0.600 




29 


67 


TSNYYWAFPS 


0.500 




30 


101 


ASLQKSIEKA 


0.500 




31 


166 


1 ANRWTDNIFA 


0.450 




32 


143 


KYKDCDPQVV 


0.360 




33 


97 


SQKHASLQKS 


0.300 
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TABLE XVIII (A) 



HLA PEPTIDE SCORING RESULTS - 121P1F1 - B35, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


g£Q 


i 34 


58 


GMVDCERIGT 


0.300 




35 


5 


KGLSAEEKRT 


0.300 


36 


194 


TFGIPEDFDY 


0.300 


37 


34 


KJAPKEKGIT 


0.300 


38 


158 


ANKVAKEAAN 


0.300 


39 


148 


DPQVVEEIRQ 


0.300 


40 


11 


EKRTRMMEIF 


0.300 


41 


1 12 


lGRCb I hhRI 


a inn 


42 


35 


IAPKEKGITA 


0.300 


43 


118 


EERTRLAKEL 


0.300 


44 


157 


QANKVAKEAA 


0.300 


45 


72 


WAFPSKALHA 


0.300 


46 


51 


LQSLVDDGMV 


0.300 


47 


105 


KSIEKAKIGR 


0.200 


48 


64 


RIGTSNYYWA 


0.200 


49 


74 


FPSKALHARK 


0.200 


50 


150 


QVVEEIRQAN 


0.200 
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TABLE V (B): 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1FI- Al, 9-MERS 



RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOI FCUI F CONTAINING THIS SUBSEOUENCE) 


\Dff 


1 


98 


FHEIIKVSY 


4.500 


Portion of 


2 


88 


VLESQDPGC 


1.800 


opa in* 
obU ID 


3 


95 


GCCFHEIIK 


1.000 


NO: 5; 
each start 


4 


91 


SQDPGCCFH 


0.750 


position is 


5 


118 


ACNPSTLGG 


0.500 


specified. 


6 


90 


ESQDPGCCF 


0.150 


the length 


7 


85 


KLEVLESQD 


0.090 


of each 


8 


104 


VSYYRKFWL 


0.075 


peptide is 
9 amino 
acids, the 


9 


96 


CCFHEIIKV 


0.050 


10 


101 


IIKVSYYRK 


0.040 


end 
oosition 


11 


99 


HEIIKVSYY 


0.025 


12 


115 


VAHACNPST 


0.020 


for each 


13 


100 


EIIKVSYYR 


0.020 


peotide is 
the start 
position 


14 


103 


KVSYYRKFW 


0.010 


15 


117 


HACNPSTLG 


0.010 


plus 
eightfQ+ 


16 


111 


WLGAVAHAC 


0.010 


17 


114 


AVAHACNPS 


0.010 


18 


87 


EVLESQDPG 


0.010 


19 


102 


1KVSYYRKF 


0.005 


20 


112 


LGAVAHACN 


0.005 


21 


93 


DPGCCFHE1 


0.003 


22 


108 


RKFWLGAVA 


| 0.001 


23 


110 


FWLGAVAHA 


0.001 


24 


113 


GAVAHACNP 


| 0.001 


25 


97 


CFHEIIKVS 


0.001 


26 


116 


AHACNPSTL 


0.001 


1 27 


89 


LESQDPGCC 


0.001 


28 


92 


QDPGCCFHE 


0.000 


29 


94 


PGCCFHEII 


0.000 


30 


110 


KFWLGAVAH 


0.000 


31 


105 


SYYRKFWLG 


0.000 


32 


86 


LEVLESQDP 


0.000 


33 


107 


YRKFWLGAV 


0.000 




34 


106 


YYRKFWLGA 


0.000 





TABLE VI (B) 

VARIANT IA HKLEVLESQDPGCCFHEI1KVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - Al, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. 
|[)// 


1 


98 


FHEIIKVSYY 


2.250 
0.900 


T^rPortio 
nofSEO 


2 
3 


88 
91 


VLESQDPGCC 
SQDPGCCFHE 


0.375 


ID NO: 5: 
each start 



position is 
specified, 
the length 
of each 
peptide is 
10 amino 
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TABLE VI (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - Al, IQ-MERS 



RANK. 


START ; 
rUM 1 1UN 


SUBSEQUENCE 

DCCIHI IC I 

tvtiolUUti Lrlo I IINvj 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


CCA 


4 


85 


KLEVLESQDP 


0.090 




5 


95 


GCCFHEIIKV 


0.050 


6 


117 


HACNPSTLGG 


0.050 


7 


97 


CFHEIIKVSY 


0.050 


8 


103 


KVSYYRKFWL 


0.050 


9 


100 


EIIKVSYYRK 


0.040 


10 


94 


PGCCFHEIIK 


0.025 


11 


111 


WLGAVAHACN j 


0.020 


12 


114 


AVAHACNPST 


0.020 


13 


87 


EVLESQDPGC 


0.020 


14 


90 


ESQDPGCCFH 


0.015 


15 


104 


VSYYRKFWLG 


0.015 


16 


113 


GAVAHACNPS 


0.010 


17 


99 


HEIIKVSYYR 


0.010 


18 


115 . 


VAHACNPSTL 


0.010 


| 19 


101 


IIKVSYYRKF 


0.010 


20 


96 


CCFHEIIKVS 


0.010 


21 


89 


LESQDPGCCF 


0.005 


22 


93 


DPGCCFHEII 


0.003 


23 


108 


RKFWLGAVAH 


0.001 


24 


92 


QDPGCCFHEI 


0.001 


25 


116 


AHACNPSTLG 


0.001 


26 


102 


IKVSYYRKFW 


0.001 


27 


110 


FWLGAVAHAC 


0.001 


28 


86 


LEVLESQDPG 


0.001 


29 


105 


SYYRKFWLGA 


0.000 


30 


112 


LGAVAHACNP 


0.000 


31 


109 


KFWLGAVAHA 


0.000 


32 


107 


YRKFWLGAVA 


0.000 




33 


84 


HKLEVLESQD 


0.000 




34 


106 


YYRKFWLGAV 


0.000 





TABLE VII (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 9-MERS 




RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


SEQr 


POSITION 


RESIDUE LISTING 


OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


IP// ! 


1 


104 


VSYYRKFWL 


24.199 


h769rPortio 


2 


111 


WLGAVAHAC 


22.853 


n of SEO 
ID NO: 5; 


3 


96 


CCFHEIIKV 


3.864 


each start 


4 


88 


VLESQDPGC 


0.541 


position is 


5 


115 


VAHACNPST 


0.176 


specified, 


6 


103 


| KVSYYRKFW 


0.126 


the length 


7 


110 


FWLGAVAHA 


0.027 

133 


of each 
peDtide is 
9 amino 



acids, the 

end 
position 
for each 



peptide is 
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TABLE VII (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - A2, 9-MERS 



RANK 


START 

rUol 1 1UIN 


SUBSEQUENCE 

Ivt/olL/U t L,lo 1 lINtj 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


CCA 


1 8 


89 


LESQDPGCC 


0.021 




9 


91 


SQDPGCCFH 


0.017 


10 


116 


AHACNPSTL 


0.015 


ii ! 


108 


RKFWLGAVA 


0.010 


12 


93 


DPGCCFHEI 


0.010 


! 13 


114 


AVAHACNPS 


0.007 


14 


87 


EVLESQDPG 


0.004 


15 


85 


KLEVLESQD 


0.003 


16 


106 


YYRKFWLGA 


0.002 


17 


109 


KFWLGAVAH 


0.002 


18 


94 


PGCCFHEII 


0.001 


19 


100 


EIIKVSYYR 


0.001 


20 


112 


LGAVAHACN 


0.001 


21 


99 


HEIIKVSYY 


0.001 


22 


86 


LEVLESQDP 


0.000 


23 


118 


ACNPSTLGG 


0.000 


24 


105 


SYYRKFWLG 


0.000 


25 


107 


YRKFWLGAV 


0.000 


26 


113 


GAVAHACNP 


0.000 


27 


97 


CFHEIIKVS 


0.000 


28 


101 


IIKVSYYRK 


0.000 




29 


90 


ESQDPGCCF 


0.000 


30 


92 


QDPGCCFHE 


0.000 




31 


102 


1KVSYYRKF 


0.000 




32 


95 


GCCFHEIIK 


0.000 




33 


117 


HACNPSTLG 


0.000 




34 


98 


FHEIIKVSY 


0.000 








TABLE VIII (B) 

VARIANT 1A: HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - A2, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE ! 
RESIDUE LISTING < 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
DF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


103 


KVSYYRKFWL 


208.697 


gO^Portion 


2 


95 


GCCFHEIIKV 


1.044 


of SEO ID 


3 


114 


AVAHACNPST 


0.652 


NO: 5: each 

start 
position is 


4 


115 


VAHACNPSTL 


0.504 


5 


87 


EVLESQDPGC 


0.495 


specified. 


6 


111 


WLGAVAHACN 


0.343 


the length 

of each 
peDtide is 
10 amino 
acids, the 


7 


109 


KFWLGAVAHA 


0.231 


8 


88 


VLESQDPGCC 


0.070 


9 


104 


VSYYRKFWLG 


0.038 


10 


92 


QDPGCCFHEI 


0.028 


end 
position for 

each 
oeotide is 
the start 


11 


105 


SYYRKFWLGA 


0.014 

134 



position 
plus nine 
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TABLE VIII (B) 

VARIANT 1A: HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


12 


110 


FWLGAVAHAC 


0.012 




13 


93 


DPGCCFHEII 


0.004 


14 


91 


SQDPGCCFHE 


0.004 


15 


85 


KLEVLESQDP 


0.003 


16 


89 


LESQDPGCCF 


0.002 


17 


96 


CCFHEIIKVS 


0.002 


18 


86 


LEVLESQDPG 


0.001 


19 


113 


GAVAHACNPS 


0.001 


20 


106 


YYRKFWLGAV 


0.001 


21 


102 


1KVSYYRKFW 


0.001 


22 


90 


ESQDPGCCFH 


0.001 


23 


108 


RKFWLGAVAH 


0.000 


24 


100 


EIIKVSYYRK 


0.000 


25 


97 


CFHEIIKVSY 


rv f\c\c\ 

0.000 


26 


98 


FHEIIKVSYY 


0.000 


27 


101 


IIKVSYYRKF 


0.000 


28 


112 


LGAVAHACNP 


0.000 


29 


99 


HEIIKVSYYR 


0.000 


30 


116 


AHACNPSTLG 


0.000 


31 


107 


YRKFWLGAVA 


0.000 


32 


117 


HACNPSTLGG 


0.000 


33 


84 


HKLEVLESQD 


0.000 


34 


94 


PGCCFHEIIK 


1 0.000 



TABLE IX (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


ccQ 


1 


101 


IIKVSYYRK 


6.000 


S^Portio 


2 


95 


GCCFHEIIK 


1.200 


n of SEO 
ID NO: 5; 


3 ! 


100 


EIIKVSYYR 


0.810 


each start 


4 


111 


WLGAVAHAC 


0.300 


position is 


5 


88 


VLESQDPGC 


0.200 


specified. 


6 


103 


KVSYYRKFW 


0.090 


the length 


7 


85 


KLEVLESQD 


0.060 


of each 


8 


99 


HEIIKVSYY 


0.054 


peptide is 
9 amino 
acids, the 


9 


104 


VSYYRKFWL 


0.045 


10 


96 


CCFHEIIKV 


0.030 


end 
position 


11 


91 


SQDPGCCFH 


0.009 


12 


98 


FHEIIKVSY 


0.006 


for each 
peptide is 


13 


93 


DPGCCFHEI 


0.005 


14 


90 


ESQDPGCCF 


0.005 


the start 
position 


15 


114 


AVAHACNPS 


0.004 


plus eifcht 



135 
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TABLE IX (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - A3, 9-MERS 



RANK 


START 


SUBSEQUENCE 
RF<iinnF i iSTrNin 

rVLZ/ 0 1 VJ \J \-j LIO 1 I IN VJ 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOI FPUI F CONTAINING THIS SI JBSFOl JFNCE^ 


SEQ 

ID// 


16 


109 


KFWLGAVAH 


0.003 




17 


87 


EVLESQDPG 


0.001 


18 


110 


FWLGAVAHA 


0.001 


19 


106 


YYRKFWLGA 


0.001 


20 


115 


VAHACNPST 


0.001 


21 


108 


RKFWLGAVA 


0.001 


22 


102 


IKVSYYRKF 


0.001 


23 


105 


SYYRKFWLG 


0.001 


24 


113 


GAVAHACNP 


0.001 


25 


118 


ACNPSTLGG 


0.001 


26 


116 


AHACNPSTL 


0.001 


27 


117 


HACNPSTLG 


0.000 


28 


107 


YRKFWLGAV 


0.000 


29 


94 


PGCCFHEII 


0.000 


30 


89 


LESQDPGCC 


0.000 


31 


92 


QDPGCCFHE 


0.000 


32 


86 


LEVLESQDP 


0.000 


33 


97 


CFHEIIKVS 


0.000 


34 


112 


LGAVAHACN 


0.000 



TABLE X (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAH ACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - A3, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


100 


EIIKVSYYRK 


2.700 


87+rPortion 


2 


103 


KVSYYRKFWL 


0.540 


of SEO ID 


3 


99 


HEIIKVSYYR 


0.081 


NO: 5: each 

start 
position is 


4 


88 


VLESQDPGCC 


0.060 


5 


85 


KLEVLESQDP 


0.060 


specified. 


6 


101 


IIKVSYYRKF 


0.060 


the length 
of each I 
peptide is 
10 amino 
acids, the 


7 


111 


WLGAVAHACN 


0.020 


8 


95 


GCCFHEIIKV 


0.018 


9 


88 


EVLESQDPGC 


0.013 


10 


98 


FHE1IKVSYY 


0.012 


end i 
position for 


11 


114 


AVAHACNPST 


0.010 


12 


97 


CFHEIIKVSY 


0.009 


each 

peptide is 
the start 
position 

dIus nine 


13 


109 


KFWLGAVAH A 


0.009 


14 


89 


LESQDPGCCF 


0.009 


15 


105 


SYYRKFWLG A 


0.006 


16 


115 


VAHACNPSTL 


0.006 




17 


93 


DPGCCFHEII 


0.005 


18 


104 


VSYYRKFWLG 


0.005 


19 


94 


PGCCFHEIIK 


0.004 



136 
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TABLE X (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 12 1P1F1 - A3, 10-MERS 



RANK 


START 

rUol i I^JIN 


SUBSEQUENCE 

RPQTmiF 1 l^TPNfi 
IvCollJUC Llo 1 liN VJ 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 

HF A MOI PPl II F PfYNTArMnsin TH1<s <\I IR^sFOl JFNCF'l 

Ur f\ IVi WL<II/V-' ULiC V^WIN 1 r\ 1 IN 1 i> VJ I IllO OUDOdyL/LiliVLi^ 


SEQ. ID// 


20 


91 


SQDPGCCFHE 


0.003 




21 


92 


QDPGCCFHEI 


0.003 


22 


96 


CCFHEIIKVS 


0.002 


23 


113 


GAVAHACNPS 


0.002 


24 


108 


RKFWLGAVAH 


0.001 


25 


110 


FWLGAVAHAC 


0.001 


26 


102 


IKVSYYRKFW 


0.000 


27 


117 


HACNPSTLGG 


0.000 


28 


90 


ESQDPGCCFH 


0.000 


29 


106 


YYRKFWLGAV 


0.000 


30 


107 


YRKFWLGAVA 


0.000 


31 


86 


LEVLESQDPG 


0.000 


32 


84 


HKLEVLESQD 


0.000 


33 


33 


AHACNPSTLG 


0.000 


34 


29 


LGAVAHACNP 


0.000 



TABLE XI (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - Al 1, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ 

ID// 


1 


95 1 


GCCFHEIIK 


1.200 


90£rPortio 
n of SEO 


2 


101 


IIKVSYYRK 


0.800 


3 


100 


EIIKVSYYR 


0.072 


ID NO: 5: 
each start 


4 


103 


KVSYYRKFW 


0.030 


position is 


5 


109 


KFWLGAVAH 


0.012 


specified. 


6 


96 


CCFHEIIKV 


0.008 


the length 


7 


106 


YYRKFWLGA 


0.008 


of each 


8 


91 


SQDPGCCFH 


0.006 


peptide is 
9 amino 
acids, the 


9 


114 


AVAHACNPS 


0.002 


10 


105 


SYYRKFWLG 


0.002 


end 
position 


11 


85 


KLEVLESQD 


0.001 


12 


104 


VSYYRKFWL 


0.001 


for each 
peptide is 


13 


108 


RKFWLGAVA 


0.001 


14 


87 


EVLESQDPG 


0.001 


the start 
position 
plus eight 


15 


113 


GAVAHACNP 


0.001 


16 


99 


HEIIKVSYY 


0.001 




17 


93 


DPGCCFHEI 


0.001 


18 


88 


VLESQDPGC 


0.000 


19 


118 


ACNPSTLGG 


0.000 


20 


111 


WLGAVAHAC 


0.000 


! 21 


110 


FWLGAVAHA 


0.000 


22 


116 


AHACNPSTL 


0.000 


23 


98 


FHEI1KVSY 


0.000 



137 
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TABLE XI (B) ! 
VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HL A PEPTIDE SCORING RESULTS -121P1F1-A11, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEOUENCE) 


SEQ 

IP// 


24 


115 


VAHACNPST 


0.000 




25 


117 


HACNPSTLG 


0.000 


26 


107 


YRKFWLGAV 


0.000 


27 


97 


CFHEIIKVS 


0.000 


28 


86 


LEVLESQDP 


0.000 


29 


92 


QDPGCCFHE 


0.000 


30 


89 


LESQDPGCC 


0.000 


31 


90 


ESQDPGCCF 


0.000 


32 


102 


IKVSYYRKF 


0.000 


33 


112 


LGAVAHACN 


0.000 


34 


94 


PGCCFHEII 


0.000 



TABLE XII (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121 P1F1 - Al 1, 10-MERS 




START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


IP// 


1 


100 


EIIKVSYYRK 


0.360 


9^Portio 
n of SEO 


2 


103 


KVSYYRKFWL 


0.180 


3 


99 


HEIIKVSYYR 


0.036 


ID NO: 5: 
each start 


4 


105 


SYYRKFWLGA 


0.016 


position is 


5 


95 


GCCFHEIIKV 


0.012 


specified. 


6 


109 


KFWLGAVAHA 


0.012 


the length 


7 


94 


PGCCFHEIIK 


0.004 


of each 
peptide is 
10 amino 


8 


106 


YYRKFWLGAV 


0.004 


9 


97 


CFHEIIKVSY 


0.002 


acids, the 


10 


114 


AVAHACNPST 


0.002 


end 
position 


11 


115 


VAHACNPSTL 


0.002 


12 


91 


SQDPGCCFHE 


0.002 


for each 
peptide is 
the start 
Dosition 
plus nine 


13 


85 


KLEVLESQDP 


0.001 


14 


108 


RKFWLGAVAH 


0.001 


15 


113 


GAVAHACNPS 


0.001 


16 


87 


EVLESQDPGC 


0.001 




17 


93 


DPGCCFHEII 


0.001 


18 


89 


LESQDPGCCF 


0.001 


19 


101 


IIKVSYYRKF 


0.000 


20 


111 


WLGAVAHACN 


0.000 


21 


117 


HACNPSTLGG 


0.000 


22 


88 


VLESQDPGCC 


0.000 


23 


98 


FHEIIKVSYY 


0.000 


24 


92 


QDPGCCFHEI 


0.000 


25 


96 


CCFHEIIKVS 


0.000 


26 


107 


YRKFWLGAV A 


0.000 


27 


102 


IKVSYYRKFW 


0.000 
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TABLE XII (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1-A11, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ 


28 


86 


LEVLESQDPG 


0.000 




29 


104 


VSYYRKFWLG 


0.000 


30 


90 


ESQDPGCCFH 


0.000 


31 


84 


HKLEVLESQD 


0.000 


32 


110 


FWLGAVAHAC 


0.000 


33 


112 


LGAVAHACNP 


0.000 


34 


116 


AHACNPSTLG 


0.000 



TABLE XIII (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 -A24, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORt (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


IP// 


1 


106 


YYRKFWLGA 


5.000 


^rPortio 
n of SEO 


2 


104 


VSYYRKFWL 


4.000 


3 


90 


ESQDPGCCF 


3.600 


ID NO: 5; 
each start 
position is 


4 


93 


DPGCCFHEI 


1.320 


5 


97 


CFHEIIKVS 


0.840 


specified. 


6 


105 


SYYRKFWLG 


0.600 


the length 


7 


116 


AHACNPSTL 


0.400 


of each 
peptide is 


8 


102 


IKVSYYRKF 


0.330 


9 


103 


KVSYYRKFW 


0.200 


9 amino 
acids, the 


10 


88 


VLESQDPGC 


0.150 


end 
position 


11 


110 


FWLGAVAHA 


0.150 


1 12 


111 


WLGAVAHAC 


0.140 


for each 
peptide is 
the start 
position 
plus eight 


13 


114 


AVAHACNPS 


0.120 


14 


96 


CCFHEIIKV 


0.110 


15 


112 


LGAVAHACN 


0.100 


16 


115 


VAHACNPST 


0.100 




17 


94 


PGCCFHEII 


0.100 


18 


109 


KFWLGAVAH 


0.100 


19 


85 


KLEVLESQD 


0.036 


20 


108 


RKFWLGAVA 


0.024 


21 


98 


FHEIIKVSY 


0.021 


22 


100 


EIIKVSYYR 


0.021 


23 


87 


EVLESQDPG 


0.018 


24 


118 


ACNPSTLGG 


0.018 


25 


113 


GAVAHACNP 


0.015 


26 


99 


HEIIKVSYY 


0.015 


27 


91 


SQDPGCCFH 


0.012 


28 


101 


IIKVSYYRK 


0.010 


29 


89 


LESQDPGCC 


0.010 


30 


95 


GCCFHEIIK 


0.010 


31 


117 


HACNPSTLG 


0.010 



139 
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TABLE XIII (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQr 

IP// 


32 


107 


YRKFWLGAV 


0.010 




33 


86 


LEVLESQDP 


0.002 


34 


92 


QDPGCCFHE 


0.002 



TABLE XIV (B) 

VARIANT 1A HKLEVLESQDPGCCFHE1IKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE Or HALr 1 1Mb Or DlbAboUClA 1 1UN 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


i 
1 


i aq 


Nvoi I KJs>r WL 


R nnn 


4 n WrPortio 
n of SEO 
ID NO: 5; 
each start 
position is 


z 


i a<; 


oil Kivr W LuA 


s nnn 


J 


i ha 


Y Y ivK.r W LuA V 


^ nnn 


A 

4 


1 1 c 

1 I J 


\/ A U A /"'XTDCTI ^ 

VAHALNro 1 L, 


4 nnn 


C 

J 


lUl 


llJvvb Y YKKr 


o onn 


specified. 


a 
O 


yj 


UruLLrncll 


i nnn 


the length 

of each 
peptide is 
10 amino 
acids, the 


/ 


1 AO 

Wy 


K.r WLuAVAHA 


i nnn 


o 
O 


y 1 


LrnbllKVoY 


n $Mn 


o 
y 


1 1 A 


ru/l r A\/AUAP 

r WLUAVAHAL 


n 7 1 n 


1 A 
10 




LboQLJrOCOr 


n onn 


end 
position for 


i i 
1 1 


yZ 


^Uruttr Hbl 


n i oft 




o / 


b V bboV^L/rUL, 


n i ftn 


each 

Deotide is 
the start 
position 

plus nine 


1 1 

13 


111 


A \/ A U A /^XTDC 


n i ftn 


1 A 

14 


oo 

oo 


V bbol^JUr Utt 


n i ^n 


i «j 




CCFHFTIKVS 
v^v^i riLiirv v o 


0.140 


16 


95 


GCCFHEI1KV 


0.110 




17 


114 


AVAHACNPST 


0.100 


18 


111 


WLGAVAHACN 


0.100 


19 


85 


KLEVLESQDP 


0.036 


20 


90 


ESQDPGCCFH 


0.018 


21 


100 


EIIKVSYYRK 


0.015 


22 


102 


IKVSYYRKFW 


0.015 


23 


98 


FHEIIKVSYY 


0.015 


24 


91 


SQDPGCCFHE 


0.012 


25 


104 


VSYYRKFWLG 


0.012 


26 


107 


YRKFWLGAV A 


0.012 


27 


112 


LGAVAHACNP 


0.010 


28 


117 


HACNPSTLGG 


0.010 


29 


84 


HKLEVLESQD 


0.002 


30 


99 


HEIIKVSYYR 


0.002 


31 


108 


RKFWLGAVAH 


0.002 


32 


86 


LEVLESQDPG 


0.002 


33 


94 


PGCCFHEIIK 


0.001 


34 


116 


AHACNPSTLG 


0.001 
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TABLE XV (B) 

VARIANT 1A KLEVLESQDPGCCFHEHKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 9-MERS 


RANK 


START 


SUBSEQUENCE 
rfsioiif i isTnsin 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOI FCI II F PONTAITsITTsJCi THIS SI IRSFOl JFNCF"i 


ccp 

ID// 


1 


93 


DPGCCFHEI 


8.000 


+044rPorti 
on of SEO 


2 


104 


VSYYRKFWL 


4.000 


3 


116 


AHACNPSTL 


1.200 


ID NO: 5: 

f»arh ^tart 


4 


115 


VAHACNPST 


0.300 


Dosition is 


5 


114 


AVAHACNPS 


0.300 


specified. 


6 


96 


CCFHEIIKV 


0.200 


the leneth 


7 


103 


KVSYYRKFW 


0.150 


of each 
peptide is 

9 amino 
acids, the 


8 


111 


WLGAVAHAC 


0.100 


9 


106 


YYRKFWLGA 


0.100 


10 


87 


EVLESQDPG 


0.050 


end 
Dosition 


i 11 


117 


HACNPSTLG 


0.045 


12 


94 


PGCCFHEII 


0.040 


for each 
Deotide is 


13 


113 


GAVAHACNP 


0.030 


14 


90 


ESQDPGCCF 


0.030 


the start 
Dosition 
plus eieht 


15 


118 


ACNPSTLGG 


0.030 


16 


88 


VLESQDPGC 


0.030 




17 


107 


YRKFWLGAV 


0.020 


18 


112 


LGAVAHACN 


0.020 


19 


89 


LESQDPGCC 


0.010 


20 


110 


FWLGAVAHA 


0.010 


21 


108 


RKFWLGAVA 


0.010 


! 22 


95 


GCCFHEIIK 


0.010 


23 


101 


IIKVSYYRK 


0.010 


24 


100 


EIIKVSYYR 


0.010 


25 


85 


KLEVLESQD 


0.003 


26 


91 


SQDPGCCFH 


0.003 


27 


97 


CFHEIIKVS 


0.002 


28 


102 


IKVSYYRKF 


0.002 


29 


100 


HEIIKVSYY 


0.002 


30 


109 


KFWLGAVAH 


0.001 


31 


86 


LEVLESQDP 


0.001 


32 


92 


QDPGCCFHE 


0.001 


33 


105 


SYYRKFWLG 


0.001 


34 


98 


FHEIIKVSY 


0.001 



TABLE XVI (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLG A VAH ACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121 PI Fl - B7, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQt 
ID// 


1 


103 


KVSYYRKFWL 


20.000 


-MW-Porti 
on of SEO 


2 


115 


VAHACNPSTL 


12.000 


3 


93 


DPGCCFHEII 


8.000 


ID NO: 5; 
each start 


4 


114 


AVAHACNPST 


1.500 


position is 



specified, 

141 the length 



of each 
peptide is 
10 amino 
acids, the 



end 
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TABLE XVI (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 10-MERS 


RANK 


START 

Pf^QlTTrVM 
rUol 1 IvJlN 


SUBSEQUENCE 
RPQinirP i iQTrwr; 

rvCkML/UC Llo 1 UNO 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCI ATION 

KJr r\ lVlV^L»HV^Ui-«C I /A 1 IN 1 IN VJ into JUDJDyUONv/CJ 


SEQ- 


5 


87 


EVLESQDPGC 


0.500 




6 


106 


YYRKFWLGAV 


0.200 


7 


95 


GCCFHEIIKV 


0.200 


8 


114 


GAVAHACNPS 


0.060 


9 


92 


QDPGCCFHEI 


0.040 


10 


117 


HACNPSTLGG 


0.030 


11 


88 


VLESQDPGCC 


0.030 


12 


96 


CCFHEIIKVS 


0.020 


13 


101 


IIKVSYYRKF 


0.020 


14 


111 


WLGAVAHACN 


0.020 


15 


no 


FWLGAVAHAC 


0.010 


16 


107 


YRKFWLGAVA 


0.010 


17 


105 


SYYRKFWLGA 


0.010 


18 


104 


VSYYRKFWLG 


0.010 


19 


109 


KFWLGAVAHA 


0.010 


20 


100 


EIIKVSYYRK 


0.010 


21 


90 


ESQDPGCCFH 


0.010 


22 


112 


LGAVAHACNP 


0.010 


23 


116 


AHACNPSTLG 


0.005 


24 


102 


IKVSYYRKFW 


0.003 


25 


89 


LESQDPGCCF 


0.003 


26 


91 


SQDPGCCFHE 


0.003 


27 


85 


KLEVLESQDP 


0.003 


28 


97 


CFHEIIKVSY 


0.002 


29 


108 


RKFWLGAVAH 


0.001 


30 


94 


PGCCFHEIIK 


0.001 


31 


86 


LEVLESQDPG 


0.001 


32 


99 


HEIIKVSYYR 


0.001 


33 


84 


HKLEVLESQD 


0.001 


34 


98 


FHEIIKVSYY 


0.001 



TABLE XVII (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIK VSYYRKFWLG A V AH ACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - B35, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. 
ID// 


1 


90 


ESQDPGCCF 


10.000 


4409rPorti 
on of SEO 


2 


93 


DPGCCFHEI 


8.000 


3 


104 


VSYYRKFWL 


5.000 


ID NO: 5; 
each start 


4 


103 


KVSYYRKFW 


1.000 


position is 


5 


115 


VAHACNPST 


0.300 


specified. 


6 


96 


CCFHEIIKV 


0.300 


the length 


7 


99 


HEIIKVSYY 


0.200 


of each 


8 


112 


LGAVAHACN 


0.100 

142 


peptide is 
9 amino 
acids, the 
end 



position 
for each 
peptide is 
the start 
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TABLE XVII (B) 

VARIANT 1A KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121 PI Fl -B35, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RbMDUh Lib 1 1NO 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 


CCA 


9 


111 


WLGAVAHAC 


0.100 




10 


114 


AVAHACNPS 


0.100 


11 


116 


AHACNPSTL 


0.100 


12 


102 


IKVSYYRKF 


0.100 


13 


107 


YRKFWLGAV 


0.060 


14 


98 


FHEIIKVSY 


0.060 


15 


94 


PGCCFHEII 


0.040 


16 


117 


HACNPSTLG 


0.030 


17 


106 


YYRKFWLGA 


0.030 


18 


101 


HKVSYYRK 


0.030 


19 


113 


GAVAHACNP 


0.030 


20 


88 


VLESQDPGC 


0.030 


21 


87 


EVLESQDPG 


0.020 


22 


97 


CFHEIIKVS 


0.020 


23 


108 


RKFWLGAVA 


0.020 


24 


89 


LESQDPGCC 


0.015 


25 


95 


GCCFHEIIK 


0.010 


26 


118 


ACNPSTLGG 


0.010 


27 


110 


FWLGAVAHA 


0.010 


28 


100 


EIIKVSYYR 


0.010 


29 


85 


KLEVLESQD 


0.006 


30 


91 


SQDPGCCFH 


0.003 


31 


109 


KFWLGAVAH 


0.002 


32 


86 


LEVLESQDP 


0.002 


33 


92 


QDPGCCFHE 


0.001 


34 


105 


SYYRKFWLG 


0.001 



TABLE XVIII (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAH ACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 -B35, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


|p// 


1 


93 


DPGCCFHEII 


8.000 


444^Porti 
on of SEO 


2 


115 


VAHACNPSTL 


3.000 


1 3 


101 


IIKVSYYRKF 


3.000 


ID NO: 5: 
each start 


4 


103 


KVSYYRKFWL 


2.000 


position is 


5 


97 


CFHEIIKVSY 


0.400 


specified, 


6 


113 


GAVAHACNPS 


0.300 


the leneth 


7 


95 


GCCFHEIIKV 


0.300 


of each 
oeDtide is 
10 amino 
acids, the 


8 


87 


EVLESQDPGC 


0.200 


9 


114 


AVAHACNPST 


0.100 


10 


89 


LESQDPGCCF 


0.100 


end 
position 


11 


90 


ESQDPGCCFH 


0.100 


12 


111 


WLGAVAHACN 


0.100 

143 


for each 
peptide is 
the start 
position 



plus nine 
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TABLE XVIII (B) 

VARIANT 1A HKLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG 
HLA PEPTIDE SCORING RESULTS - 121P1F1 - B35, 10-MERS 



RANK 


START 


SUBSEQUENCE 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 

OF A MDI FP1I1 F PniMTAITsirNin THI<s ^il IRSFOI IFNCF^ 


CCQ 

ID// 


13 


96 


CCFHEIIKVS 


0.100 




14 


106 


YYRKFWLGAV 


0.060 


15 


98 


FHEIIKVSYY 


0.060 


16 


102 


IKVSYYRKFW 


0.050 


17 


104 


VSYYRKFWLG 


0.050 


18 


88 


VLESQDPGCC 


0.045 


19 


92 


QDPGCCFHEI 


0.040 


20 


107 


YRKFWLGAVA 


0.030 


21 


117 


HACNPSTLGG 


0.030 


22 


109 


KFWLGAVAHA 


0.020 


23 


105 


SYYRKFWLGA 


0.010 


24 


110 


FWLGAVAHAC 


0.010 


25 


112 


LGAVAHACNP 


0.010 


26 


100 


EIIKVSYYRK 


0.010 


27 


85 


KLEVLESQDP 


0.009 


28 


91 


SQDPGCCFHE 


0.003 


29 


108 


RKFWLGAVAH 


0.002 


30 


84 


HKLEVLESQD 


0.002 


31 


86 


LEVLESQDPG 


0.001 


32 


116 


AHACNPSTLG 


0.001 


33 


94 


PGCCFHEIIK 


0.001 


34 


99 


HEIIKVSYYR 


0.001 



144 
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TABLE V (C): 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 121P1F1- Al, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ ID// 


1 


4 


KMELSEGSQ 


0.450 


Portion of 
SEO ID 


2 


5 


MELSEGSQK 


0.010 


3 


6 


ELSEGSQKH 


0.010 


NO: 7: each 

start 
position is 


4 


2 


KCKMELSEG 


0.001 


5 


3 


CKMELSEGS 


0.001 


specified. 


6 


1 


MKCKMELSE 


0.000 


the length 

oi each 
peptide is 9 

amino 
acids, the 

end 
position for 

each 
oeotide is 
the start 
position 

plus 
eight+W 
1178.7 



TABLE VI (C) 

VARIANT IB MKCKMELSEGSQKH A 

HLA PEPTIDE SCORING RESULTS - 12IP1FI - Al, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 




1 


4 


KMELSEGSQK 


9.000 


44SWortio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


6 


ELSEGSQKHA 


0.010 


3 


2 


KCKMELSEGS 


0.001 


4 


5 


MELSEGSQKH 


0.001 


5 


3 


CKMELSEGSQ 


0.001 


specified. 


6 


1 


MKCKMELSEG 


0.000 


the length 
of each 
peptide is 
10 amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus nine 



TABLE VII (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 12 1 PI Fl - A2, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. \DU 
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TABLE VII (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


6 


ELSEGSQKH 


0.023 


4439rPortio 
n of SEO 
ID NO: 7; 
each start 
Dosition is 


2 


5 


MELSEGSQK 


0.002 


3 


3 


CKMELSEGS 


0.001 


4 


4 


KMELSEGSQ 


0.000 


5 


2 


KCKMELSEG 


0.000 


specified. 


6 


1 


MKCKMELSE 


0.000 


the length 

of each 
peptide is 9 

amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus eight 



TABLE VIII (C) 

VARIANT IB MKCKMELSEGSQKH A 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


6 


ELSEGSQKHA 


1.528 


++95rPortio 
n of SEO 
ID NO: 7: 
each start 
position is 


2 


5 


MELSEGSQKH 


0.009 


3 


4 


KMELSEGSQK 


0.002 


4 


1 


MKCKMELSEG 


0.000 


5 


3 


CKMELSEGSQ 


0.000 


specified. 


6 


2 


KCKMELSEGS 


0.000 


the length 

of each 
peptide is 
10 amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus nine 



TABLE IX (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 1 2 1 P 1 F 1 - A3, 9-MERS 


RAN 
K 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


6 


ELSEGSQKH 


0.090 


1201-Portio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


5 


MELSEGSQK 


0.090 


3 


4 


KMELSEGSQ 


0.018 


4 


2 


KCKMELSEG 


0.001 



specified % 
the length 

of each 
peptide is 9 

amino 
acids, the 
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TABLE IX (C) 

VARIANT IB MKCKMELSEG SQKH 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 9-MERS 


RAN 
K 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


5 


3 


CKMELSEGS 


0.000 




6 


1 


MKCKMELSE 


0.000 



TABLE X (C) 

VARIANT IB MKCKMELSEGSQKHA 

HLA PEPTIDE SCORING RESULTS - 12 1P1F1 - A3, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


4 


KMELSEGSQK 


60.000 


+007rPortio 
n of SEO 
ID NO: 7: 
each start 
position is 
specified. 


2 


6 


ELSEGSQKHA 


0.045 


3 


2 


KCKMELSEGS 


0.001 


4 


5 


MELSEGSQKH 


0.001 


5 


1 


MKCKMELSEG 


0.000 


6 


3 


CKMELSEGSQ 


0.000 


the leneth 
of each 
peptide is 
10 amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus nine 



TABLE XI (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS -121P1F1-A11, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ. ID// 


1 


5 


MELSEGSQK 


0.090 


4244rPortio 
n of SEO 
ID NO: 7: 
each start 
position is 


2 


4 


KMELSEGSQ 


0.001 


3 


6 


ELSEGSQKH 


0.001 


4 


2 


KCKMELSEG 


0.001 


5 


3 


CKMELSEGS 


0.000 


specified. 


6 


1 


MKCKMELSE 


0.000 


the leneth 

of each 
peptide is 9 

amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus eieht 
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TABLE XII (C) 

VARIANT IB MKCKMELSEGSQKHA 

HLA PEPTIDE SCORING RESULTS -121P1F1-A11, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.ID// 


1 


4 


KMELSEGSQK 


1.200 


4^9rPortio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


5 


MELSEGSQKH 


0.001 


3 


2 


KCKMELSEGS 


0.001 


4 


6 


ELSEGSQKHA 


0.001 


5 


3 


CKMELSEGSQ 


0.000 


specified. 


6 


1 


MKCKMELSEG 


0.000 


the length 

of each 
peptide is 
10 amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus nine 



TABLE XIII (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 1 2 1 P 1 F 1 - A24, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.ID// 


1 


4 


KMELSEGSQ 


0.030 


1225.Portio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


2 


KCKMELSEG 


0.022 


3 


3 


CKMELSEGS 


0.022 


4 


6 


ELSEGSQKH 


0.016 


5 


5 


MELSEGSQK 


0.002 


specified. 


6 


1 


MKCKMELSE 


0.001 


the length 

of each 
peptide is 9 

amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
olus eight 



TABLE XIV (C) 

VARIANT IB MKCKMELSEGSQKHA 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.ID// 


i 1 


2 


KCKMELSEGS 


0.240 


+»WPortio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


6 


ELSEGSQKHA 


0.120 


3 


4 


KMELSEGSQK 


0.030 


4 


5 


MELSEGSQKH 


0.002 
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the length 

of each 
peptide is 
10 amino 
acids, the 
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TABLE XIV (C) 

VARIANT IB MKCKMELSEGSQKHA 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.1D// 


5 


3 


CKMELSEGSQ 


0.002 




6 


1 


MKCKMELSEG 


0.001 



TABLE XV (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDft 


1 


6 


ELSEGSQKH 


0.010 


1237.Portio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


| 2 


KCKMELSEG 


0.010 


3 


3 


CKMELSEGS 


0.006 


4 


4 


KMELSEGSQ 


0.003 


5 


5 


MELSEGSQK 


0.001 


specified. 


6 


1 


MKCKMELSE 


0.001 


the length 

of each 
peptide is 9 

amino 
acids, the 

end 
position for 

each 
oeDtide is 
the start 
position 
plus eight 



TABLE XVI (C) 

VARIANT IB MKCKMELSEGSQKHA 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDff 


1 


6 


ELSEGSQKHA 


0.100 


|1243.Portio 
n of SEO 
ID NO: 7: 
each start 
position is 


2 


2 


KCKMELSEGS 


0.020 


3 


3 


CKMELSEGSQ 


0.003 


4 


4 


KMELSEGSQK 


0.003 


5 


5 


MELSEGSQKH 


0.001 


SDecified, 


6 


1 


MKCKMELSEG 


0.001 


the length 

of each 
peptide is 
10 amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus nine 
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TABLE XVII (C) 

VARIANT IB MKCKMELSEGSQKH 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B35, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDff 


1 


2 


KCKMELSEG 


0.090 


4049rPortio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


6 


ELSEGSQKH 


0.020 


3 


3 


CKMELSEGS 


0.020 


4 


4 


KMELSEGSQ 


0.006 


5 


5 


MELSEGSQK 


0.002 


specified. 


6 


1 


MKCKMELSE 


0.001 


inc icn^iii 

of each 
peptide is 9 

amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus eight 



TABLE XVIII (C) 

VARIANT IB MKCKMELSEGSQKH A 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - B35, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDff 


1 


2 


KCKMELSEGS 


0.600 


43S^Portio 
n of SEO 
ID NO: 7; 
each start 
position is 


2 


6 


ELSEGSQKHA 


0.200 


3 


4 


KMELSEGSQK 


0.009 


4 


3 


CKMELSEGSQ 


0.002 


5 


1 


MKCKMELSEG 


0.002 


specified. 


6 


5 


MELSEGSQKH 


0.001 


the leneth 
of each 
peptide is 
10 amino 
acids, the 

end 
position for 

each 
peptide is 
the start 
position 
plus nine 
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TABLE V (D): 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1- Al, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfl 


1 


114 


RCETAKQIK 


18.000 


Portion 


Df 


2 


111 


KIGRCETAK 


0.020 


SEO IDT 


JO: 


3 


113 


GRCETAKQI 


0.001 


9; each s 


art 

is 


4 


112 


IGRCETAKQ 


0.001 


position 
specifie 


1 


5 


110 


AKIGRCETA 


0.001 


the lengtl 


of 


each pepi 


ide 


is 9 ami 
acids, tl 
end posit 
for eac 
peptide 
the star! 
position r 


10 

e 

on 
i 

is 
t 

lus 


eighth 
1 183.: 





TABLE VI (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - I21P1F1 - AI, IQ-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfl 


1 


113 


GRCETAKQIK 


0.010 


+36+rPor 
n of SEO 


io 
ID 


2 


110 


AK1GRCETAK 


0.010 


3 


111 


KIGRCETAKQ 


0.002 


NO: 9; e; 

start 
position 


ich 
is 


4 


109 


KAKIGRCETA 


0.001 


5 


112 


IGRCETAKQI 


0.000 


specifie 
the lenetl 


1, 
of 


each pepi 


ide 


is 10 am 
acids, tl 
end Dosit 
for eac 
peptide 
the star 
position t 


no 

e 

on 
i 

is 
t 

lus 


nine 





TABLE VII (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A2, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


111 


KIGRCETAK 


0.007 


4366rPor 
n of SEO 


io 
ID 


2 


113 


GRCETAKQI 


0.006 


3 


110 


AKIGRCETA 


0.003 


NO: 9; e; 

start 
position 
SDecifie 

the lenetl 


ch 

is 
i, 
of 


4 


112 


IGRCETAKQ 


0.000 

151 


each pepi 


ide 


is 9 amii 
acids, tl 
end posit 
for eac 


io 
e 

on 
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TABLE VII (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A2, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDf 


5 


114 


RCETAKQIK 


0.000 





TABLE VIII (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


112 


IGRCETAKQI 


0.009 


4474rPor| 
n of SEO 


io 
ID 


2 


111 


KIGRCETAKQ 


0.007 


3 


109 


KAKIGRCETA 


0.004 


NO: 9; e 
start 
position 


ich 
is 


4 


110 


AKIGRCETAK 


0.000 


5 


113 


GRCETAKQIK 


0.000 


specifie 
the lenetl 


J, 
of 


each pepl 


ide 


is 10 am 
acids, t\ 
end posit 
for eac 
peptide 
the stai 
position i 


no 

e 

on 
1 

is 
t 

lus 


nine 





TABLE IX (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfr 


1 


111 


KIGRCETAK 


6.000 


+376rPor 
n of SEO 


io 
ID 


2 


114 


RCETAKQIK 


L °- 200 


! 3 


113 


GRCETAKQI 


0.001 


NO: 9: e 
start 
position 


tch 
is 


4 


110 


AKIGRCETA 


0.000 


5 


112 


IGRCETAKQ 


0.000 


specifie 
the lengtl 


of 


each pepl 


ide 


is 9 ami 
acids, th 
end oosit 
for eac 
peptide 
the sial 
position i 


io 

ie 

on 

i 

is 
t 

lus 


eight 





TABLE X (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 
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TABLE X (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 10-MERS ! 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDff 


1 


113 


GRCETAKQIK 


0.090 


42S-kPor 
nofSEO 


io 
ID 


2 


110 


AKIGRCETAK 


0.045 


3 


111 


KIGRCETAKQ 


0.006 


NO: 9: e 
start 
position 


ich 
is 


4 


109 


KAKIGRCETA 


0.006 


5 


112 


IGRCETAKQI 


0.000 


specifie 
the lenetl 


j, 
of 


each pepl 


ide 


is 10 am: 
acids, tl 
end posit 
for eac 
peptide 
the stai 
position r 


no 
ie 
on 
i 

is 
t 

lus 


nine 





TABLE XI (D) 

VARIANT 2 AKIGRCETAKQIK 



9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


111 


KIGRCETAK 


1.200 


+086,Por 
n of SEO 


io 
ID 


2 


114 


RCETAKQIK 


0.600 


3 


110 


AKIGRCETA 


0.000 


NO: 9; e; 

start 
position 


ich 
is 


4 


113 


GRCETAKQI 


0.000 


5 


112 


IGRCETAKQ 


0.000 


soecifie 
the ienatf 


i, 
of 


each pepl 


ide 


is 9 ami 
acids, tl 
end posit 


io 

e 

on 


for eac 
peptide 
the star 
position t 


i 

is 
t 

lus 


eight 
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TABLE XII (D) 

VARIANT 2 KAKIGRCETAKQIK 

HL A PEPTIDE SCORING RESULTS -121P1F1-A11,! 0-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfl 


1 


113 


GRCETAKQIK 


0.060 


4WkPor 
n of SEO 


io 
ID 


i 2 


110 


AKIGRCETAK 


0.030 


3 


109 


KAKIGRCETA 


0.006 


NO: 9; e 
start 
position 


ich 
is 


! 4 


111 


KIGRCETAKQ 


0.001 


5 


112 


IGRCETAKQI 


0.000 


specifie 
the lenetl 


of 


each oeDl 


ide 


is 10 am 
acids, tl 
end oosit 
for eac 
peptide 
the stai 
position r 1 


no 
e 

on 
i 

is 
t 

lus 


nine 





TABLE XIII (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDff 


1 


113 


GRCETAKQI 


0.120 


4396,Por 
n of SEO 


io 
ID 


! 2 


114 


RCETAKQIK 


0.036 


3 


111 


KIGRCETAK 


0.020 


NO: 9; e; 

start 
position 


tch 
is 


4 


110 


AKIGRCETA 


0.015 


5 


112 


1GRCETAKQ 


0.011 


specifie 
the lenetl 


I, 
of 


each pent 


ide 


is 9 ami 
acids, tl 
end posit 
for eac 
oeDtide 
the stai 
position i 


10 

e 

on 
i 

is 
t 

lus 


eight 





TABLE XIV (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 12 IP 1F1 - A24, 1 0-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDf 


| 1 


112 


IGRCETAKQI 


1.000 


n of SEO 


io 
ID 


2 


109 


KAKIGRCETA 


0.200 


3 


111 


KIGRCETAKQ 


0.022 


NO: 9: ei 
start 
position 
specifie 

the lenetl 


ich 

is 
i, 
of 


4 


110 


AKIGRCETAK 


0.002 

154 


each Depi 


ide 


is 10 am 
acids, tl 
end oosit 


no 
e 

on 


for eac 
peptide 
the stai 
position t 


i 

is 
t 

lus 
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TABLE XIV (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121 P1F1 - A24, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


5 


113 


GRCETAKQIK 


0.001 






TABLE XV (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDf 


1 


112 


IGRCETAKQ 


0.100 


+306rPor 
n ofSEO 


io 
ID 


2 


113 


GRCETAKQI 


0.040 


3 


110 


AKIGRCETA 


0.030 


NO: 9; e: 
start 
position 


tch 
is 


4 


111 


KIGRCETAK 


0.010 


5 


114 


RCETAKQIK 


0.003 


specifie 
the lengtf 


J, 
of 


each oeol 


ide 


is 9 amii 
acids, t\ 
end Dosit 


io 

e 

on 


for eac 
peptide 
the stai 
oosition i 


i 

is 
t 

lus 


eieht 





TABLE XVI (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 -B7, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


112 


IGRCETAKQI 


4.000 


434-kPor 
n ofSEO 


io 
ID 


2 


109 


KAKIGRCETA 


0.300 


3 


111 


KIGRCETAKQ 


0.010 


NO: 9: e; 

start 
position 


ten 
is 


4 


110 


AKIGRCETAK 


0.003 


5 


113 


GRCETAKQIK 


0.001 


specifie 
the lenetl 


i, 
of 


each pepi 


ide 


is 10 am 
acids, tl 
end Dosit 
for eac 
peptide 
the stai 
position i 


no 

e 

ion 
i 

is 
t 

lus 


nine 













TABLE XVII (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 -B35, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDf 



155 



511582003440 



TABLE XVII (D) 

VARIANT 2 AKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - B35, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


113 


GRCETAKQI 


0.080 


4344rPor 
n of SEO 


io 

ID 


2 


112 


IGRCETAKQ 


0.045 


3 


111 


KIGRCETAK 


0.020 


NO: 9; ei 
start 
position 


ich 
is 


4 


110 


AKIGRCETA 


0.010 


5 


114 


RCETAKQIK 


0.006 


specifie 
the lenetl 


j, 
of 


each DeDi 


ide 


is 9 ami 
acids. tf 
end posit 
for eac 
Deotide 
the stai 
oosition t 


io 
e 

on 
l 

is 
t 

lus 


eight 





TABLE XVIII (D) 

VARIANT 2 KAKIGRCETAKQIK 

HLA PEPTIDE SCORING RESULTS - 121P1F1 -B35, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.ID^ 


1 


109 


KAKIGRCETA 


1.800 


n of SEO 


io 
ID 


2 


112 


IGRCETAKQI 


1.200 


3 


111 


KIGRCETAKQ 


0.030 


NO: 9: e 
start 
position 


ich 
is 


4 


113 


GRCETAKQIK 


0.002 


5 


110 


AK1GRCETAK 


0.001 


specifie 
the lenetr 


1, 
of 


each oeoi 


ide 


is 10 am 
acids, th 
end posit 
for eac 
peptide 
the stall 
position t 


no 

e 

on 
i 

is 
t 

lus 


nine 





156 



511582003440 



TABLE V (E): 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121P1F1- Al, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDj 


1 


151 


VVEEIHNIF 


9.000 


n of SEO 


io 
ID 


2 


154 


EIHNIFAIK 


0.400 


3 


152 


VEEIHNIFA 


0.225 


NO: It 
each sta 


rt 


4 


151 


QVVEEIHNI 


0.010 


position 


is 


5 


155 


IHNIFAIKS 


0.003 


specifie 




6 


156 


HNIFAIKSW 


0.003 


the length 


of 


7 j 


153 


EEIHNIFAI 


0.003 


each pepi 


ide 


8 


148 


DPQVVEEIH 


0.003 


is 9 ami 


10 


9 


149 


PQVVEEIHN 


0.001 


acids, tr 
end posit 
for eac 
peptide 
the stai 
position \ 


e 

on 
l 

is 
t 

lus 


eight? 





TABLE VI (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - Al, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


151 


VVEEIHNIFA 


4.500 


-W^Por 
n of SEO 


io 
ID 


1 2 


152 


VEEIHNIFAI 


0.225 


1 3 


150 


QVVEEIHNIF 


0.100 


NO: 11 
each sta 


rt 


i 4 


154 


EIHNIFAIKS 


0.050 


position 


is 


5 


153 


EEIHNIFAIK 


0.020 


specific 


j, 


6 


148 


DPQVVEEIHN 


0.013 


the lengtl 


of 


7 


156 


HNIFAIKSWA 


0.003 


each pepl 


ide 


8 


155 


IHNIFAIKSW 


0.001 


is 10 ami 


no 


9 


147 


CDPQVVEEIH 


0.001 


acids, tl 
end posit 


e 

on 


10 


149 


PQVVEEIHNI 


0.000 


for eac 
peptide 
the stai 
position t 


i 

is 
t 

lus 


nine 





TABLE VII (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


150 


QVVEEIHNI 


8.608 


++£7rPor 
n of SEO 


io 
ID 


2 


153 


EEIHNIFAI 


0.203 


3 


152 


VEEIHNIFA 


0.058 


NO: 11 
each sts 


rt 


4 


151 


VVEEIHNIF 


0.001 


position 


is 


5 


155 


IHNIFAIKS 


0.000 

157 


specifie 
the lengtl 
each pepl 

is 9 ami 

acids, tl 
end posit 
for eac 

peptide 


i, 
of 
ide 
io 

e 

on 
^ 

is 



511582003440 



TABLE VII (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 12 IP 1F1 - A2, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDft 


6 


149 


PQVVEEIHN 


0.000 




7 


154 


EIHNIFAIK 


0.000 




8 


156 


HNIFAIKSW 


0.000 




9 


148 


DPQVVEEIH 


0.000 





TABLE VIII (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A2, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDft 


1 


151 


VVEEIHNIFA 


1.067 


4496,Por 
n of SEO 


io 
ID 


2 


152 


VEEIHNIFA1 


0.294 


3 


149 


PQVVEEIHNI 


0.054 


NO: 11 
each sts 


rt 


4 


150 


QVVEEIHNIF 


0.011 


position 


is 


! 5 


156 


HNIFAIKSWA 


0.006 


specifie 


1, 


6 


154 


EIHNIFAIKS 


0.003 


the lengtl 


of 


7 


155 


IHNIFAIKSW 


0.000 


each pepi 


ide 


8 


148 


DPQVVEEIHN 


0.000 


is 10 am 
acids, tl 


no 


9 


147 


CDPQVVEEIH 


0.000 


end posit 


e 

on 


10 


153 


EEIHNIFAIK 


0.000 


for eac 
peptide 
the stai 
position r 


is 
t 

lus 


nine 





TABLE IX (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A3, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDf 


1 


154 


EIHNIFAIK 


2.700 


+306rPor 


io 


2 


151 


VVEEIHNIF 


0.450 


n of SEO 


ID 


3 


150 


QVVEEIHNI 


0.203 


NO: 1 1 
each sts 


rt 


4 


153 


EEIHNIFAI 


0.004 


position 


is 


5 


152 


VEEIHNIFA 


0.001 


specifie 


j, 


6 


148 


DPQVVEEIH 


0.001 


the lengtl 


of 


7 


156 


HNIFAIKSW 


0.000 


each pepi 


ide 


8 


155 


IHNIFAIKS 


0.000 


is 9 ami 
acids, tl 


io 


9 


149 


PQVVEEIHN 


0.000 


end posit 


e 

on 


for eac 
peptide 
the stai 
position i 


i 

is 
t 

Jus 


eight 





158 



511582003440 



TABLE X (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - A3, 10-MERS 


rv/\iN is. 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfl 


1 


150 


QVVEEIHNIF 


0.675 


+544rPor 
n of SEO 


io 
ID 


I 2 


153 


1 EEIHN1FAIK 


0.122 


1 3 


151 


VVEEIHNIFA 


0.060 


NO: li 
each sts 


rt 


4 




VFFIHNIFAI 

V UUtl Hill Al 


0.008 


position 


is 


5 


154 


EIHNIFAIKS 


0.007 


specific 




1 6 


149 


PQVVEEIHNI 


0.004 


the length 


of 


7 


156 


HNIFAIKSWA 


0.001 


each pepi 


ide 


8 


147 


CDPQVVEEIH 


0.000 


is 10 am 


no 


9 


155 


IHNIFAIKSW 


0.000 


acids, tr 
end posit 


e 

on 


10 


148 


DPQVVEEIHN 


0.000 


for eac 
peptide 
the stai 
Dosition i 


i 

is 
t 

lus 


nine 






TABLE XI (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121P1F1 -All, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfl 


1 


154 


EIHNIFAIK 


0.120 


+32SrPor 
n of SEO 


io 
ID 


2 


150 


QVVEEIHNI 


0.030 


3 


151 


VVEEIHNIF 


0.020 


NO: 1 1 
each sta 


rt 


4 


152 


VEEIHNIFA 


0.001 


position 


is 


5 


153 


EEIHNIFAI 


0.001 


spec i fie 


i, 


6 


148 


DPQVVEEIH 


0.001 


the 1 en £tl 


of 


7 


156 


HNIFAIKSW 


0.000 


each pern 


ide 


8 


149 


PQVVEEIHN 


0.000 


is 9 ami 
acids, tr 


io 


9 


155 


IHNIFAIKS 


0.000 


end posit 
for eac 
peptide 
the stai 

position t 


e 

on 
i 

is 
t 

lus 


eiaht 





TABLE XII (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 121P1F1-A11, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDff 


1 


151 


VVEEIHNIFA 


0.040 


+334,Por 
n of SEO 


io 
ID 


2 


150 


QVVEEIHNIF 


0.030 


3 


153 


EEIHNIFAIK 


0.027 


NO: 1 I 
each sU 


rt 


4 


152 


VEEIHNIFAI 


0.002 


position 


is 


5 


149 


PQVVEEIHNI 


0.001 


specifle 


1, 


6 


156 


HNIFAIKSWA 


0.001 


the lenetl 


of 


7 


154 


EIHNIFAIKS 


0.000 


each pepi 


ide 



159 



acids, tl e 
end posit on 
for each 
peptide 



is 



the stai t 
position r lus 



511582003440 



TABLE XII (E) 

VARIANT 3 CDPQVVEE1HNIFAIKSWA 
HLA PEPTIDE SCORING RESULTS - 12 1P1F1 - Al 1, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDf 


8 


147 


CDPQVVEEIH 


0.000 




9 


148 


DPQVVEEIHN 


0.000 


10 


155 


IHNIFAIKSW 


0.000 



TABLE XIII (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - A24, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDf 


1 


151 


VVEEIHNIF 


6.048 


4344rPor 
n of SEO 


io 
ID 


2 


150 


QVVEEIHNI 


1.800 


3 


156 


HNIFAIKSW 


0.150 


NO: 1 1 
each sts 


rt 


4 


153 


EEIHNIFAI 


0.150 


position 


is 


5 


148 


DPQVVEEIH 


0.021 


specifie 


i, 


6 


154 


EIHNIFAIK 


0.017 


the lenatl 


of 


7 


155 


IHNIFAIKS 


0.017 


each pepi 


ide 


8 


149 


PQVVEEIHN 


0.015 


is 9 ami 


io 


9 


152 


VEEIHNIFA 


0.015 


acids, tl 
end oosit 
for eac 
peptide 
the stai 
position \ 


e 

on 
i 

is 
t 

lus 


eight 





TABLE XIV (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 12 1P1F1 - A24, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDf 


1 


150 


QVVEEIHNIF 


6.048 


m^Pon 
n of SEO 


io 
ID 


2 


156 


HNIFAIKSWA 


0.210 


3 


151 


VVEEIHNIFA 


0.180 


NO: It 
each sta 


rt 


4 


148 


DPQVVEEIHN 


0.150 


position 


is 


5 


149 


PQVVEEIHNI 


0.150 


specific 


I 


6 


152 


VEEIHNIFAI 


0.150 


the len&tl 


lOf 


7 


154 


EIHNIFAIKS 


0.110 


each oeol 


ide 


8 


155 


IHNIFAIKSW 


0.015 


is 1 0 am 
acids, tl 


no 


9 


153 


EEIHNIFAIK 


0.003 


end posit 


e 

on 


10 


147 


CDPQVVEEIH 


0.002 


for eac 
peptide 
the stai 
position t 


l 

is 
t 

lus 


nine 





160 



511582003440 



TABLE XV (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 9-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfr 


1 


150 


QVVEEIHNI 


2.000 


+36^Por 
n of SEO 


io 
ID 


2 


148 


DPQVVEE1H 


0.200 


3 


153 


EEIHNIFAI 


0.040 


NO: I 
each stE 


rt 


4 


151 


VVEEIHNIF 


0.030 


position 


is 


5 


156 


HNIFAIKSW 


0.020 


specifie 


i, 


6 


154 


EIHNIFAIK 


0.010 


the length 


of 


7 


152 


VEEIHNIFA 


0.003 


each pepi 


ide 


8 


149 


PQVVEEIHN 


0.002 


is 9 ami 


io 


9 


155 


IHNIFAIKS 


0.002 


acids, tr 
end posit 
for eac 
peptide 
the stai 
position i 


e 

on 
l 

is 
t 

lus 


eight 





TABLE XVI (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B7, 10-MERS 



RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


148 


DPQVVEEIHN 


0.400 


4£73rPor 
n of SEO 


io 
ID 


2 


151 


VVEEIHNIFA 


0.150 


3 


150 


QVVEEIHNIF 


0.100 


NO: 11 
each sta 


rt 


4 


156 


HNIFAIKSWA j 


0.100 


position 


is 


5 


149 


PQVVEEIHNI 


0.040 


specifie 


j, 


6 


154 


EIHNIFAIKS 


0.020 


the lengtt 


of 


7 


152 


VEEIHNIFAI 


0.012 


each oepi 


ide 


8 


155 


IHNIFAIKSW 


0.002 


is 10 am 


no 


9 


153 


EEIHNIFAIK 


0.001 


acids, tl 
end posit 


e 

ion 


10 


147 


CDPQVVEEIH 


0.001 


for eac 
peptide 
the stai 
position \ 


l 

is 
t 

lus 


nine 





TABLE XVII (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121P1F1 - B35, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.lDfl 


1 


150 


QVVEEIHNI 


1.200 


m2rPor 
n of SEO 


io 
ID 


2 


151 


VVEEIHNIF 


0.600 


3 


156 


HNIFAIKSW 


0.500 


NO: 1 I 
each stE 


rt 


4 


148 


DPQVVEEIH 


0.200 


position 


is 


5 


153 


EEIHNIFAI 


0.040 


specifie 


J, 


6 


149 


PQVVEEIHN 


0.015 


the lengtl 


of 


7 


154 


EIHNIFAIK 


0.010 

161 


each pepi 


ide 


is 9 ami 
acids, tr 1 
end posit 
for eac 
peptide 
the staij 
position i 


10 

e 

on 
i 

is 
t 

lus 



511582003440 



TABLE XVII (E) 

VARIANT 3 DPQVVEEIHNIFAIKSW 

HLA PEPTIDE SCORING RESULTS - 121 PI Fl - B35, 9-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDff 


8 


155 


IHNIFAIKS 


0.010 




9 


152 


VEEIHNIFA 


0.003 





TABLE XVIII (E) 

VARIANT 3 CDPQVVEEIHNIFAIKSWA 

HLA PEPTIDE SCORING RESULTS - 121P1F1 -B35, 10-MERS 


RANK 


START 
POSITION 


SUBSEQUENCE 
RESIDUE LISTING 


SCORE (ESTIMATE OF HALF TIME OF DISASSOCIATION 
OF A MOLECULE CONTAINING THIS SUBSEQUENCE) 


SEQ.IDfl 


1 


148 


DPQVVEEIHN 


3.000 


44£+rPor 
nof SEO 


io 
ID 


2 


150 


QVVEE1HNIF 


2.000 


3 


154 


EIHNIFAIKS 


0.100 


NO: 1 1 

each sta 


rt 


4 


156 


HNIFAIKSWA 


0.100 


position 


is 


5 


149 


PQVVEEIHNI 


0.060 


specifie 


j, 


6 


151 


VVEEIHNIFA 


0.060 


the leneU 


of 


7 


155 


IHNIFAIKSW 


0.050 


each pepl 


ide 


8 


152 


VEEIHN1FAI 


0.012 


is 10 am 


no 


9 


153 


EEIHNIFAIK 


0.001 


acids, tl 
end posit 


e 

on 


10 


147 


CDPQVVEEIH 


0.001 


for eac 
peptide 
the stai 
Dosition i 


i 

is 
t 

lus 


nine 





162 
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Table XIX: Motifs and Post-translational Modifications of 121P1F1 

Protein kinase C phosphorylation site 
Number of matches: 4 

1 2-4 SICK 

2 46-48 SVK 

3 97-99 SQK 

4 129-131 SLR 

Casein kinase II phosphorylation site 
Number of matches: 4 

1 8-11 SAEE 

2 46-49 SVKE 

3 53-56 SLVD 

4 129-132 SLRD 

N-myristoylation site 

58-63 GMVDCE 



163 
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Table XX: Frequently Occurring Motifs 


Name 


avrg. °/o 
identity 


Description 


Potential Function 


zf-C2H2 


34% 


Zinc finger, C2H2 type 


Nucleic acid-binding protein functions as 
transcrintion factor nuclear location 
probable 


LYiocnrorne o in 


DO /0 


Cytochrome b(N- 

i*m i n 51 1 ^ /V** A /r\£*t R 
tCI 111 Hldl ^/ \J\Jf UCID 


membrane bound oxidase, generate 

QiinprnYiHp 

aUUCl UA1U.C 


is 


19% 


Immunoglobulin domain 


domains are one hundred amino acids long 
and include a conserved intradomain 
disulfide bond. 


WD40 


18% 


WD domain, G-beta repeat 


tandem reneats of* about 40 residues each 
containing a Trp-Asp motif. Function in 
signal transduction and protein interaction 


PD7 
r L/Zi 


Ld /o 


Pr>7 Hr\moin 
rLJ£^ uOlllairi 


may function in targeting signaling 

idaIppi i l*»c tr\ cnh-mi^mhrjinr^iiQ c i fpc 
1 1 IU ictu (Co v\J auu-iiicjiiui ttiiuua suca 


LRR 


28% 


Leucine Rich Repeat 


short sequence motifs involved in protein- 
protein interactions 


pKinase 


Z J /O 


Pr/~ifr<=»in UtidCP Hnmain 
r lO Iv 111 KlllabC UOIIlalll 


conserved cauuyuc* loic cuiiiiiiuii iu uum 
serine/threonine and tyrosine protein 
kinases containing an ATP binding site and 

51 r*sit"5i 1 \rtt C Clt"f* 
a caiaiyuc csiic 


r n 


1O/0 


pu Hnmnin 
i n uuiiiaiii 


pleckstrin homology involved in 
intracellular signaling or as constituents of 

the rvtoskeleton 


EGF 


34% 


EGF- like domain 


30-40 amino-acid long found in the 
extracellular domain of membrane-bound 
proteins or in secreted proteins 


rvt 


49% 


Reverse transcriptase 
(RNA-dependent DNA 
polymerase) 




ank 


25% 


Ank repeat 


Cytoplasmic protein, associates integral 
membrane proteins to the cytoskeleton 
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oxidored ql 


32% 


NADH- 

Ubiquinone/plastoquinone 
(complex I), various chains 


membrane associated. Involved in proton 
translocation across the membrane 


efhand 


24% 


EF hand 


calcium-binding domain, consists of al2 
residue loop flanked on both sides by a 12 
residue alpha-helical domain 


rvp 


79% 


Retroviral aspartyl protease 


Aspartyl or acid proteases, centered on a 
catalytic aspartyl residue 


Collagen 


42% 


Collagen triple helix repeat 
(20 copies) 


extracellular structural proteins involved in 
formation of connective tissue. The 
sequence consists of the G-X-Y and the 
polypeptide chains forms a triple helix. 


fh3 


20% 


Fibronectin type III domain 


Located in the extracellular ligand-binding 
region of receptors and is about 200 amino 
acid residues long with two pairs of 
cysteines involved in disulfide bonds 


7tm 1 


19% 


7 transmembrane receptor 
(rhodopsin family) 


seven hydrophobic transmembrane regions, 
with the N-terminus located extracellularly 
while the C-terminus is cytoplasmic. 
Signal through G proteins 
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TABLE XXI: Properties of 121P1F1 



121P1F1 



Bio in format ic 
Program 



URL located on the World Wide Web at Outcome 



ORF 

Protein length 
Transmembrane region 



Signal Peptide 
pi 

Molecular weight 
Localization 



Motifs 



ORF finder 

TM Pred 
HMMTop 
Sosui 
TMHMM 
Signal P 
pI/MW tool 
pI/MW tool 
PSORT 

PSORT II 
Pfam 

Prints 
Blocks 



http://www .ch.embnet.org/ 

http://www .enzim.hu/hmmtop/ 

http://www .genome.ad.jp/SOSui/ 

http://www .cbs.dtu.dk/services/TMHMM 

http://www .cbs.dtu.dk/services/SignalP/ 

http://www .expasy.ch/tools/ 

http://www .expasy.ch/tools/ 

http^/psort. n ibb. ac.j p/ 

http^/psort.nibb.ac.jp/ 
http://www .sanger.ac.uk/Pfam/ 

http://www .biochem.ucl.ac.uk/ 
http://www .blocks.fhcrc.org/ 



618 bp 
205 aa 
no TM 

no TM, intracellular 

no TM, soluble protein 

no TM 

none 

8.28 

23.7 kDa 

30% nuclear, 10% 
mitochondrial 

65% nuclear, 17% cytoplasmic 
Basic Zipper motif, Myc leucine 
zipper 

Steroid hormone receptor 
signature 

no significant motif 





Variant 1A 


Bioinformatic 
Program 


URL located on the World Wide Web at Outcome i 


ORF 


ORF finder 


618bp 



Protein length 
Transmembrane region 



Signal Peptide 
Pi 

Molecular weight 
Localization 



Motifs 



TM Pred http://www .ch.embnet.org/ 

HMMTop http://www .enzim.hu/hmmtop/ 

Sosui http://www .genome.ad.jp/SOSui/ 

TMHMM http://www .cbs.dtu.dk/services/TMHMM 

Signal P h ttp://w w w . cbs.dtu .dk/serv i ces/S ignal P/ 

pI/MW tool http://www .expasy.ch/tools/ 

pI/MW tool http://www .expasy.ch/tools/ 

PSORT http^psort.nibb.ac.jp/ 

PSORT II h#fK#psort.nibb.ac.jp/ 

Pfam http://www .sanger.ac.uk/Pfam/ 

Prints http://www .biochem.ucl.ac.uk/ 

Blocks http://www .blocks.fhcrc.org/ 



126 aa 

no TM | 

no TM, extracellular | 

no TM, soluble protein j 

no TM j 

none | 

8.65 j 

14.3 kDa j 
30% nuclear, 1 1% peroxisome | 

30% nuclear, 52.2% j 
cytoplasmic 

no significant motif | 

no significant motif | 

no significant motif | 





Variant 4 


Bioinformatic 
Program 


URL located on the World Wide Web at Outcome ! 


ORF 


ORF finder 


618 bp 



Protein length 
Transmembrane region 



Signal Peptide 
Pi 

Molecular weight 
Localization 



TM Pred http://www .ch.embnet.org/ 

HMMTop http://www .enzim.hu/hmmtop/ 

Sosui http://www .genome.adjp/SQSui/ 

TMHMM http://www .cbs.dtu.dk/services/TMHMM 

Signal P http://www .cbs.dtu.dk/services/SignalP/ 

pI/MW tool http://www .expasy.ch/tools/ 

pI/MW tool http://wwsv .expasy.ch/tools/ 

PSORT h«[*//psort.nibb.ac.jp/ 
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190 aa 
no TM 

no TM, intracellular 

no TM, soluble protein 

no TM 

none 

6.05 

22.02 kDa 
30% nuclear, 10% 
mitochondrial matrix space, 
10% lysosome 



511582003440 



PSORT II http^psort.nibb.ac.jp/ 65.2% nuclear, 21.7% | 

mitochondrial, 1 3% cytoplasmic 

Motifs Pfam http://vvwvv .sanger.ac.uk/Pfam/ bZip transcription factor j 

Myc leucine zipper 

Prints http://vvvvvv .biochem.ucl.ac.uk/ steroid hormone receptor j 

signature 

Blocks http://vvvvw .blocks.fhcrc.org/ no significant motif | 
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Table XXI IA. Nucleotide sequence of splice variant 1. (SEQ ID NO 41) 



ccaaaatcaa 
ggaagcccct 
actcgcatga 
aagattgctc 
ttagttgatg 
tttccaagta 
cctggctgct 
gcagtggctc 
tgactttccc 
atgcaagcct 
gaaccaggct 
aagtagaaaa 
aagtagccaa 
ccaaaagaaa 
actttgacta 
tgtaaatttt 
tttatcattt 
aaaaaaaa 



acgcgtccgg 
gcgcccgcgc 
tggaaatatt 
ccaaagagaa 
atggtatggt 
aagctcttca 
gcttccatga 
acgcctgtaa 
ccacccccac 
acagaaaagc 
agcaaaagag 
atacaaagac 
agaagctgct 
atttgggttt 
catagactaa 
aaactattat 
tattaatgtt 



gcctgtcccg 
catgtcaaag 
ttctgaaaca 
aggcattact 
tgactgtgag 
tgcaaggaaa 
aataattaaa 
tcccagcact 
atgaagtgca 
attgagaaag 
ctttcttcac 
tgtgatccgc 
aacagatgga 
gaagaaaata 
aatattccat 
ctaactaagt 
aaataaagtg 



cccctctccc 
aaaaaaggac 
aaagatgtat 
gctatgtcag 
aggatcggaa 
cataagttgg 
gtctcctatt 
ttgggaggct 
agatggagtt 
ctaaaattgg 
ttcgagacca 
aagttgtgga 
ctgataacat 
aaattgatag 
ggtggtgaag 
gtactgaatt 
taaaatgcaa 



i^ciciy^y^yyy 




60 


l y ct y ty Lay a 


q y ci a □ Qy u<^ u 


120 


t_ CL.aa (_ Lauu 


ay a l u ty y ay 


180 


Luaaciy aay i— 


c c f~ i~ c ^ ^ n c 


240 


L L too I i_ a 




300 


a y y t. t^tyyo 


a L» V — L- V — Q \-j CI \— • 


360 


atagaaaatt 


ctggctgggc 


420 


gaggcgggca 


gatcacgagg 


/on 
4 o U 


gtctgaggga 


agtcaaaagc 


540 


ccgatgtgaa 


acggaagagc 


600 


aagggaacag 


ctaaaggcag 


660 


agaaatacgc 


caagcaaata 


720 


attcgcaata 


aaatcttggg 


780 


aacttttgga 


attccagaag 


840 


gatgtacaag 


cttgtgaata 


900 


gtcgtttgcc 


tgtaactgtg 


960 


aaaaaaaaaa 


aaaaaaaaaa 


1020 
1028 



Table XXIIIA. Nucleotide sequence alignment of 121P1F1 (SEQ ID NO 42) with splice 
variant 1. (SEQ ID NO 43) . 

Score = 687 bits (357), Expect = 0 . Oldentities = 357/357 (100%) Strand = Plus / 
Plus 

121P1F1 : 1 ccaaaatcaaacgcgtccgggcctgtcccgcccctctccccaagcgcgggcccggccagc 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 1: 1 ccaaaatcaaacgcgtccgggcctgtcccgcccctctccccaagcgcgggcccggccagc 60 



121P1F1 



61 



Variant 1: 61 



ggaagcccctgcgcccgcgccatgtcaaagaaaaaaggactgagtgcagaagaaaagaga 120 

I I I I I if I I I I I I I I t I I I I I I I I I I I I I I I i I I I I I I I I I I I I i I I I I I I I I I I I I I I I 

ggaagcccctgcgcccgcgccatgtcaaagaaaaaaggactgagtgcagaagaaaagaga 120 



121P1F1 : 121 actcgcatgatggaaatattttctgaaacaaaagatgtatttcaattaaaagacttggag 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 1: 121 actcgcatgatggaaatattttctgaaacaaaagatgtatttcaattaaaagacttggag 180 

121P1F1 : 181 aagattgctcccaaagagaaaggcattactgctatgtcagtaaaagaagtccttcaaagc 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Variant 1: 181 aagattgctcccaaagagaaaggcattactgctatgtcagtaaaagaagtccttcaaagc 240 

121P1F1 : 241 ttagttgatgatggtatggttgactgtgagaggatcggaacttctaattattattgggct 300 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 1: 241 ttagttgatgatggtatggttgactgtgagaggatcggaacttctaattattattgggct 300 

121P1F1 : 301 tttccaagtaaagctcttcatgcaaggaaacataagttggaggttctggaatctcag 357 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 1: 301 tttccaagtaaagctcttcatgcaaggaaacataagttggaggttctggaatctcag 357 



Score - 985 bits (512), Expect = 0 . Oldentities = 512/512 (100%) Strand = Plus / 
Plus 

121P1F1 : 356 agttgtctgagggaagtcaaaagcatgcaagcctacagaaaagcattgagaaagctaaaa 415 

168 
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Variant 1: 517 



I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

agttgtctgagggaagtcaaaagcatgcaagcctacagaaaagcattgagaaagctaaaa 



576 



121P1F1 



416 



Variant 1: 577 



ttggccgatgtgaaacggaagagcgaaccaggctagcaaaagagctttcttcacttcgag 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
ttggccgatgtgaaacggaagagcgaaccaggctagcaaaagagctttcttcacttcgag 



475 



636 



10 



121P1F1 



476 



Variant 1: 637 



accaaagggaacagctaaaggcagaagtagaaaaatacaaagactgtgatccgcaagttg 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
accaaagggaacagctaaaggcagaagtagaaaaatacaaagactgtgatccgcaagttg 



535 



696 



15 121P1F1 : 536 tggaagaaatacgccaagcaaataaagtagccaaagaagctgctaacagatggactgata 595 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 1: 697 tggaagaaatacgccaagcaaataaagtagccaaagaagctgctaacagatggactgata 756 



20 



121P1F1 



596 



Variant 1: 757 



acatattcgcaataaaatcttgggccaaaagaaaatttgggtttgaagaaaataaaattg 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
acatattcgcaataaaatcttgggccaaaagaaaatttgggtttgaagaaaataaaattg 



655 



816 



25 



121P1F1 



656 



Variant 1; 817 



atagaacttttggaattccagaagactttgactacatagactaaaatattccatggtggt 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
atagaacttttggaattccagaagactttgactacatagactaaaatattccatggtggt 



715 



876 



30 



121P1F1 



716 



Variant 1: 877 



gaaggatgtacaagcttgtgaatatgtaaattttaaactattatctaactaagtgtactg 775 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
gaaggatgtacaagcttgtgaatatgtaaattttaaactattatctaactaagtgtactg 936 



35 



121P1F1 



776 



Variant 1: 937 



aattgtcgtttgcctgtaactgtgtttatcattttattaatgttaaataaagtgtaaaat 835 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

aattgtcgtttgcctgtaactgtgtttatcattttattaatgttaaataaagtgtaaaat 996 



40 121P1F1 : 836 gcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 867 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 1: 997 gcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 1028 



45 



50 



Table XXIVA. Amino acid sequence alignment of 121P1F1 (SEQ ID NO 44) and splice 
variant 1. (SEQ ID NO 45) . 



Score 
(100%) 



183 bits (465), Expect = 6e-4 7 Identities = 92/92 (100%), Positives = 92/92 



121P1F1 : 1 MSKKKGLSAEEKRTI^4MEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMV 60 

MSKKKGLSAEEKRTRMMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDG^4V 
Variant 1A: 1 MSKKKGLSAEEKRTRMMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMV 60 



55 121P1F1 : 61 DCERIGTSNYYWAFPSKALHARKHKLEVLESQ 92 

DCERIGTSNYYWAFPSKALHARKHKLEVLESQ 
Variant 1A: 61 DCERIGTSNYYWAFPSKALHARKHKLEVLESQ 92 



60 Score = 229 bits (584), Expect = le-60Identities = 113/114 
114/114 (99%) 



(99%) , Positives = 



169 
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121P1F1 : 92 QLSEGSQKHASLQKSIEKAKIGRCETEERTRLAKELSSLRDQREQLKAEVEKYKDCDPQV 151 

+LSEGSQKHASLQKSIEKAKIGRCETEERTRLAKELSSLRDQREQLKAEVEKYKDCDPQV 
Variant IB: 6 ELSEGSQKHASLQKSIEKAKIGRCETEERTRLAKELSSLRDQREQLKAEVEKYKDCDPQV 65 

121P1F1 : 152 VEEIRQANKVAKEAANRWTDNI FAIKSWAKRKFGFEENKIDRT FGIPEDFDYID 205 

VEEIRQANKVAKEAANRWTDNI FAIKSWAKRKFGFEENKIDRT FGIPEDFDYID 
Variant IB: 66 VEEIRQANKVAKEAANRWTDNI FAIKSWAKRKFGFEENKIDRT FGIPEDFDYID 119 



10 



15 



Table XXVA. Peptide sequences from the translation of the nucleotide sequence of 
splice variant 1 . 

>splice variant 1A ORF:82..462 Frame +1 (SEQ ID NO 46) . 

MSKKKGLSAE EKRTRMMEIF SETKDVFQLK DLEKIAPKEK GITAMSVKEV LQSLVDDGMV 60 
DCERIGTSNY YWAFPSKALH ARKHKLEVLE SQDPGCCFHE IIKVSYYRKF WLGAVAHACN 120 
PSTLGG 12 6 



20 



>splice variant IB ORF:501..860 Frame +3 (SEQ ID NO 47) . 

MKCKMELSEG SQKHASLQKS IEKAKIGRCE TEERTRLAKE LSSLRDQREQ LKAEVEKYKD 60 
CDPQVVEEIR QANKVAKEAA NRWTDNIFAI KSWAKRKFGF EENKIDRTFG IPEDFDYID 119 



Table XXI IB. Nucleotide sequence of splice variant 2. (SEQ ID NO 48) 



25 



30 



35 



ccaaaatcaa 
ggaagcccct 
actcgcatga 
aagattgctc 
ttagttgatg 
tttccaagta 
tctgagggaa 
cgatgtgaaa 
acatattcgc 
atagaacttt 
gaaggatgta 
aattgtcgtt 
gcaaaaaaaa 



acgcgtccgg 
gcgcccgcgc 
tggaaatatt 
ccaaagagaa 
atggtatggt 
aagctcttca 
gtcaaaagca 
cggccaagca 
aataaaatct 
tggaattcca 
caagcttgtg 
tgcctgtaac 
aaaaaaaaaa 



gcctgtcccg 
catgtcaaag 
ttctgaaaca 
aggcattact 
tgactgtgag 
tgcaaggaaa 
tgcaagccta 
aataaagtag 
tgggccaaaa 
gaagactttg 
aatatgtaaa 
tgtgtttatc 
aaaaaaaaaa 



cccctctccc 
aaaaaaggac 
aaagatgtat 
gctatgtcag 
aggatcggaa 
cataagttgg 
cagaaaagca 
ccaaagaagc 
gaaaatttgg 
actacataga 
ttttaaacta 
attttattaa 
aa 



caagcgcggg 


cccggccagc 


60 


tgagtgcaga 


agaaaagaga 


120 


ttcaattaaa 


agacttggag 


180 


taaaagaagt 


ccttcaaagc 


240 


cttctaatta 


ttattgggct 


300 


aggttctgga 


atctcagttg 


360 


ttgagaaagc 


taaaattggc 


420 


tgctaacaga 


tggactgata 


480 


gtttgaagaa 


aataaaattg 


540 


ctaaaatatt 


ccatggtggt 


600 


ttatctaact 


aagtgtactg 


660 


tgttaaataa 


agtgtaaaat 


720 
752 



Table XXIIIB. Nucleotide sequence alignment of 121P1F1 (SEQ ID NO 49) with splice 
40 variant 2. (SEQ ID NO 50) 

Score = 833 bits (433), Expect = 0 . Oldentities = 433/433 (100%) Strand = Plus / 
Plus 



45 121P1F1 



Variant 2: 1 



ccaaaatcaaacgcgtccgggcctgtcccgcccctctccccaagcgcgggcccggccagc 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ccaaaatcaaacgcgtccgggcctgtcccgcccctctccccaagcgcgggcccggccagc 60 



50 121P1F1 : 61 ggaagcccctgcgcccgcgccatgtcaaagaaaaaaggactgagtgcagaagaaaagaga 120 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 61 ggaagcccctgcgcccgcgccatgtcaaagaaaaaaggactgagtgcagaagaaaagaga 120 

55 121P1F1 : 121 actcgcatgatggaaatattttctgaaacaaaagatgtatttcaattaaaagacttggag 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 121 actcgcatgatggaaatattttctgaaacaaaagatgtatttcaattaaaagacttggag 180 



60 121P1F1 : 181 aagattgctcccaaagagaaaggcattactgctatgtcagtaaaagaagtccttcaaagc 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 



170 
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Variant 2: 181 aagattgctcccaaagagaaaggcattactgctatgtcagtaaaagaagtccttcaaagc 240 

121P1F1 : 241 ttagttgatgatggtatggttgactgtgagaggatcggaacttctaattattattgggct 300 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 241 ttagttgatgatggtatggttgactgtgagaggatcggaacttctaattattattgggct 300 

121P1F1 : 301 tttccaagtaaagctcttcatgcaaggaaacataagttggaggttctggaatctcagttg 360 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 301 tttccaagtaaagctcttcatgcaaggaaacataagttggaggttctggaatctcagttg 360 

121P1F1 : 361 tctgagggaagtcaaaagcatgcaagcctacagaaaagcattgagaaagctaaaattggc 420 

II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 361 tctgagggaagtcaaaagcatgcaagcctacagaaaagcattgagaaagctaaaattggc 420 



121P1F1 



421 



Variant 2: 421 



cgatgtgaaacgg 433 
I I I I I I I I I I I I I 
cgatgtgaaacgg 433 



Score = 615 bits (320), Expect = e-173Identities = 320/320 
Plus 121P1F1 = (SEQ ID NO 51), Variant 2 = (SEQ ID NO 52) 



(100%) Strand = Plus / 



121P1F1 : 548 gccaagcaaataaagtagccaaagaagctgctaacagatggactgataacatattcgcaa 607 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 433 gccaagcaaataaagtagccaaagaagctgctaacagatggactgataacatattcgcaa 492 



121P1F1 : 608 taaaatcttgggccaaaagaaaatttgggtttgaagaaaataaaattgatagaacttttg 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 4 93 taaaatcttgggccaaaagaaaatttgggtttgaagaaaataaaattgatagaacttttg 



667 



552 



121P1F1 : 668 gaattccagaagactttgactacatagactaaaatattccatggtggtgaaggatgtaca 727 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Variant 2: 553 gaattccagaagactttgactacatagactaaaatattccatggtggtgaaggatgtaca 612 

121P1F1 : 728 agcttgtgaatatgtaaattttaaactattatctaactaagtgtactgaattgtcgtttg 787 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 2: 613 agcttgtgaatatgtaaattttaaactattatctaactaagtgtactgaattgtcgtttg 672 

121P1F1 : 788 cctgtaactgtgtttatcattttattaatgttaaataaagtgtaaaatgcaaaaaaaaaa 847 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 2: 673 cctgtaactgtgtttatcattttattaatgttaaataaagtgtaaaatgcaaaaaaaaaa 732 



121P1F1 : 848 aaaaaaaaaaaaaaaaaaaa 867 

I I I I I I I I I I I I II I I I I i I 
Variant 2: 733 aaaaaaaaaaaaaaaaaaaa 752 



Table XXIVB. Amino acid sequence alignment of 121P1F1 (SEQ ID NO 53) and splice 
variant 2. (SEQ ID NO 54) 

Score = 232 bits (591), Expect - 2e-61Identities = 117/122 (95%), Positives = 
120/122 (97%) 



171 



511582003440 



121P1F1 : 1 
Variant 2: 1 
121P1F1 



MSKKKGLSAEEKRTRMMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMV 60 
MSKKKGLSAEEKRTRMMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMV 
MSKKKGLSAEEKRTRMMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMV 60 



61 DCERIGTSNYYWAFPSKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEER 120 
DCERIGTSNYYWAFPSKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCET ++ 
Variant 2: 61 DCERIGTSNYYWAFPSKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETAKQ 120 

10 121P1F1 : 121 TR 122 

+ 

Variant 2: 121 IK 122 



15 Table XXVB. Peptide sequences from the translation of the nucleotide sequence of 
splice variant 2. (SEQ ID NO 55) 



MSKKKGLSAE EKRTRMMEIF SETKDVFQLK DLEKIAPKEK GITAMSVKEV LQSLVDDGMV 60 

DCERIGTSNY YWAFPSKALH ARKHKLEVLE SQLSEGSQKH ASLQKSIEKA KIGRCETAKQ 120 

20 IK 122 

Table XXIIC. Nucleotide sequence of splice variant 3. (SEQ ID NO 56) ♦ 



ccaaaatcaa acgcgtccgg gcctgtcccg cccctctccc caagcgcggg cccggccagc 60 

25 ggaagcccct gcgcccgcgc catgtcaaag aaaaaaggac tgagtgcaga agaaaagaga 120 

actcgcatga tggaaatatt ttctgaaaca aaagatgtat ttcaattaaa agacttggag 180 

aagattgctc ccaaagagaa aggcattact gctatgtcag taaaagaagt ccttcaaagc 240 

ttagttgatg atggtatggt tgactgtgag aggatcggaa cttctaatta ttattgggct 300 

tttccaagta aagctcttca tgcaaggaaa cataagttgg aggttctgga atctcagttg 360 

30 tctgagggaa gtcaaaagca tgcaagccta cagaaaagca ttgagaaagc taaaattggc 420 

cgatgtgaaa cggaagagcg aaccaggcta gcaaaagagc tttcttcact tcgagaccaa 480 

agggaacagc taaaggcaga agtagaaaaa tacaaagact gtgatccgca agttgtggaa 540 

gaaatacata acatattcgc aataaaatct tgggccaaaa gaaaatttgg gtttgaagaa 600 

aataaaattg atagaacttt tggaattcca gaagactttg actacataga ctaaaatatt 660 

35 ccatggtggt gaaggatgta caagcttgtg aatatgtaaa ttttaaacta ttatctaact 720 

aagtgtactg aattgtcgtt tgcctgtaac tgtgtttatc attttattaa tgttaaataa 780 

agtgtaaaat gcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 822 



40 



45 



50 



55 



60 



Table XXIIIC. Nucleotide sequence alignment of 121P1F1 (SEQ ID NO 57) with splice 
variant 3. (SEQ ID NO 58) . 

Score = 1052 bits (547), Expect = 0 . Oldentities = 547/547 (100%) Strand = Plus / 
Plus 

121P1F1 : 1 ccaaaatcaaacgcgtccgggcctgtcccgcccctctccccaagcgcgggcccggccagc 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Variant 3: 1 ccaaaatcaaacgcgtccgggcctgtcccgcccctctccccaagcgcgggcccggccagc 60 

121P1F1 : 61 ggaagcccctgcgcccgcgccatgtcaaagaaaaaaggactgagtgcagaagaaaagaga 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 3: 61 ggaagcccctgcgcccgcgccatgtcaaagaaaaaaggactgagtgcagaagaaaagaga 120 

121P1F1 : 121 actcgcatgatggaaatattttctgaaacaaaagatgtatttcaattaaaagacttggag 180 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 3: 121 actcgcatgatggaaatattttctgaaacaaaagatgtatttcaattaaaagacttggag 180 

121P1F1 : 181 aagattgctcccaaagagaaaggcattactgctatgtcagtaaaagaagtccttcaaagc 240 

172 



511582003440 



I II I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Variant 3: 181 aagattgctcccaaagagaaaggcattactgctatgtcagtaaaagaagtccttcaaagc 240 



121P1F1 : 241 ttagttgatgatggtatggttgactgtgagaggatcggaacttctaattattattgggct 
I II I I I I! I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I 1 I I I I I I I I I I 
Variant 3: 241 ttagttgatgatggtatggttgactgtgagaggatcggaacttctaattattattgggct 



300 



300 



10 121P1F1 : 301 tttccaagtaaagctcttcatgcaaggaaacataagttggaggttctggaatctcagttg 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 3: 301 tttccaagtaaagctcttcatgcaaggaaacataagttggaggttctggaatctcagttg 360 

15 '121P1F1 : 361 tctgagggaagtcaaaagcatgcaagcctacagaaaagcattgagaaagctaaaattggc 420 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 3: 361 tctgagggaagtcaaaagcatgcaagcctacagaaaagcattgagaaagctaaaattggc 4 20 

20 121P1F1 : 421 cgatgtgaaacggaagagcgaaccaggctagcaaaagagctttcttcacttcgagaccaa 480 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 3: 421 cgatgtgaaacggaagagcgaaccaggctagcaaaagagctttcttcacttcgagaccaa 480 

25 121P1F1 : 481 agggaacagctaaaggcagaagtagaaaaatacaaagactgtgatccgcaagttgtggaa 540 

I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 3: 481 agggaacagctaaaggcagaagtagaaaaatacaaagactgtgatccgcaagttgtggaa 540 



30 



121P1F1 



541 



Variant 3: 541 



gaaatac 547 
I I I I I I I 
gaaatac 547 



35 



40 



Score = 529 bits (275), Expect - e-147 Identities = 275/275 (100%) Strand = Plus / 
Plus 121P1F1 = (SEQ ID NO 59), Variant 3 = (SEQ ID NO 60). 



121P1F1 



593 



ataacatattcgcaataaaatcttgggccaaaagaaaatttgggtttgaagaaaataaaa 
I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 3: 548 ataacatattcgcaataaaatcttgggccaaaagaaaatttgggtttgaagaaaataaaa 



652 



607 



45 



121P1F1 : 653 ttgatagaacttttggaattccagaagactttgactacatagactaaaatattccatggt 712 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 3: 608 ttgatagaacttttggaattccagaagactttgactacatagactaaaatattccatggt 667 



121P1F1 : 713 ggtgaaggatgtacaagcttgtgaatatgtaaattttaaactattatctaactaagtgta 772 
I I I I I I I I I I 1 I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
50 Variant 3: 668 ggtgaaggatgtacaagcttgtgaatatgtaaattttaaactattatctaactaagtgta 727 

121P1F1 : 773 ctgaattgtcgtttgcctgtaactgtgtttatcattttattaatgttaaataaagtgtaa 832 
I I I I I I I I I I I I I I I I I I II I I I I I II I I I I II I I I I I I I I I M I I I I I I I I I I I I I I I I 
55 Variant 3: 728 ctgaattgtcgtttgcctgtaactgtgtttatcattttattaatgttaaataaagtgtaa 787 



60 



121P1F1 : 833 aatgcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 867 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 3: 788 aatgcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 822 



173 



511582003440 



Table XXIVC. Amino acid sequence alignment of 121P1F1 (SEQ ID NO 61) and splice 



va.ri3n t 


3. 


(SEQ ID NO 62) . 




Score = 


365 bits (937), Expect = e- 101 Identities = 189/205 (92%), Positives 




1 ft Q /9Ci ^ 

103/ ^UJ 


(92%) , 


Gaps = 15/205 (7%) 




121P1F1 




1 


M9K] ; <'K^^T.c;AFFKRTR^4MFTF c ?FTKnVFOT.K^)T J FKTAPKEKGTTAM55VKEVLOSLVDDG^4V 


60 








MQTfK'K^T ^AFFKRTRMMF T F ,( ?FTKnVFOT,KnT,FKTAPKFKnTTAM ( iVKF,VI.OSLVDDGMV 




\/a •»» i a r\ +* 
VQl ±dll L 


3: 


1 


MSKKKGLSAEEKRTRMMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMV 


60 


i oi pi rl 




61 


DCERIGTSNYYWAFPSKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEER 


120 








DCERIGTSNYYWAFPSKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEER 




Variant 


3: 


61 


DCERIGTSNYYWAFPSKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEER 


120 


121P1F1 




121 


TRLAKELSSLRDQREQLKAEVEKYKDCDPQWEEIRQANKVAKEAANRWTDNI FAIKSWA 


180 








TRLAKELSSLRDQREQLKAEVEKYKDCDPQWEEI NI FAIKSWA 




Variant 


3: 


121 


TRLAKELSSLRDQREQLKAEVEKYKDCDPQWEEIH NI FAIKSWA 


165 


121P1F1 




181 


KRKFGFEENKIDRTFGIPEDFDYID 205 










KRKFGFEENKIDRTFGIPEDFDYID 




Variant 


3: 


166 


KRKFGFEENKIDRTFGIPEDFDYID 190 





25 Table XXVC. Peptide sequences from the translation of" the nucleotide sequence of 
splice variant 3. (SEQ ID NO 63) . 

MSKKKGLSAE EKRTRMMEIF SETKDVFQLK DLEKIAPKEK GITAMSVKEV LQSLVDDGMV 60 
DCERIGTSNY YWAFPSKALH ARKHKLEVLE SQLSEGSQKH ASLQKSIEKA KIGRCETEER 120 
30 TRLAKELSSL RDQREQLKAE VEKYKDCDPQ WEE I HN I FA IKSWAKRKFG FEENKIDRTF 180 
GIPEDFDYID 190 



35 



40 



45 



50 



55 



Table XXIID. Nucleotide sequence of splice variant 4, (SEQ ID NO 64) 



gttttctgta 
tggatcttca 
gggcaggagt 
ttaaggacgc 
aaagaaaaaa 
aacaaaagat 
tactgctatg 
tgagaggatc 
gaaacataag 
cctacagaaa 
gctagcaaaa 
aaaatacaaa 
caaagaagct 
aaaatttggg 
ctacatagac 
tttaaactat 
ttttattaat 
aaagcaggat 
ccatctcagt 
tccaagctta 
aagtt 



ttgtaatatg 
ctgtgtgcca 
cgctgctctt 
tctgcactga 
ggactgagtg 
gtatttcaat 
tcagtaaaag 
ggaacttcta 
ttggaggttc 
agcattgaga 
gagctttctt 
gactgtgatc 
gctaacagat 
tttgaagaaa 
taaaatattc 
tatctaacta 
gttaaataaa 
gataaccata 
caaagtcgag 
cgtacgcgtg 



tagagcacat 
attagtcgat 
gtgccgggtg 
attaggcttc 
cagaagaaaa 
taaaagactt 
aagtccttca 
attattattg 
tggaatctca 
aagctaaaat 
cacttcgaga 
cgcaagttgt 
ggactgataa 
ataaaattga 
catggtggtg 
agtgtactga 
gtgtaaaatg 
tccccccagt 
cggccgcgaa 
catgcgacgt 



tccagaactg 
ttctgtgaaa 
ctgctggttg 
ctcgtgggtc 
gagaactcgc 
ggagaagatt 
aagcttagtt 
ggcttttcca 
gttgtctgag 
tggccgatgt 
ccaaagggaa 
ggaagaaata 
catattcgca 
tagaactttt 
aaggatgtac 
attgtcgttt 
cagatgttct 
gctcatcaaa 
tttagtagta 
catagctctt 



ctcagtttcg 


agttacctaa 


60 


acgccccggt 


ttctgccaaa 


120 


tgtagggcgc 


tgttgctttt 


180 


atgatcagtt 


aagtcctgtc 


240 


atgatggaaa 


tattttctga 


300 


gctcccaaag 


agaaaggcat 


360 


gatgatggta 


tggttgactg 


420 


agtaaagctc 


ttcatgcaag 


480 


ggaagtcaaa 


agcatgcaag 


540 


gaaacggaag 


agcgaaccag 


600 


cagctaaagg 


cagaagtaga 


660 


cgccaagcaa 


ataaagtagc 


720 


ataaaatctt 


gggccaaaag 


780 


ggaattccag 


aagactttga 


840 


aagcttgtga 


atatgtaaat 


900 


gcctgtaact 


gtgtttatca 


960 


tcaccccttt 


tggtagaaca 


1020 


gtaggacact 


aaaaatccat 


1080 


gtagcggccg 


ctctagagga 


1140 


ctatagtgtc 


acctaaattc 


1200 
1205 



Table XXIIID. Nucleotide sequence alignment of 121P1F1 (SEQ ID NO 65) with splice 
60 variant 4. (SEQ ID NO 66) . 



174 



511582003440 



Score = 1454 bits (756), Expect = 0 . Oldentities = 756/756 (100%) 
Plus 



Strand = Plus / 



121P1F1 



Variant 4 : 



83 tgtcaaagaaaaaaggactgagtgcagaagaaaagagaactcgcatgatggaaatatttt 142 

I I I I I I I I I I I I I I I I I I I I I I I I t I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
237 tgtcaaagaaaaaaggactgagtgcagaagaaaagagaactcgcatgatggaaatatttt 296 



121P1F1 : 143 ctgaaacaaaagatgtatttcaattaaaagacttggagaagattgctcccaaagagaaag 
10 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 4: 297 ctgaaacaaaagatgtatttcaattaaaagacttggagaagattgctcccaaagagaaag 



202 



356 



121P1F1 : 203 gcattactgctatgtcagtaaaagaagtccttcaaagcttagttgatgatggtatggttg 262 

15 I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I II I I I I I II I I II II I I I I 

Variant 4: 357 gcattactgctatgtcagtaaaagaagtccttcaaagcttagttgatgatggtatggttg 416 

121P1F1 : 263 actgtgagaggatcggaacttctaattattattgggcttttccaagtaaagctcttcatg 322 

20 I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Variant 4: 417 actgtgagaggatcggaacttctaattattattgggcttttccaagtaaagctcttcatg 476 

121P1F1 : 323 caaggaaacataagttggaggttctggaatctcagttgtctgagggaagtcaaaagcatg 382 

25 I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 4: 477 caaggaaacataagttggaggttctggaatctcagttgtctgagggaagtcaaaagcatg 536 

121P1F1 : 383 caagcctacagaaaagcattgagaaagctaaaattggccgatgtgaaacggaagagcgaa 442 

30 I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I 

Variant 4: 537 caagcctacagaaaagcattgagaaagctaaaattggccgatgtgaaacggaagagcgaa 596 

121P1F1 : 443 ccaggctagcaaaagagctttcttcacttcgagaccaaagggaacagctaaaggcagaag 502 

35 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 4: 597 ccaggctagcaaaagagctttcttcacttcgagaccaaagggaacagctaaaggcagaag 656 



121P1F1 : 503 tagaaaaatacaaagactgtgatccgcaagttgtggaagaaatacgccaagcaaataaag 
40 I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I 

Variant 4: 657 tagaaaaatacaaagactgtgatccgcaagttgtggaagaaatacgccaagcaaataaag 



562 



716 



121P1F1 : 563 tagccaaagaagctgctaacagatggactgataacatattcgcaataaaatcttgggcca 622 

45 I I || I I I I I I I I II II I I I II I I I I I I I I I I I II I I II I I I II I I I I I I I I I I I I M I I I 

Variant 4: 717 tagccaaagaagctgctaacagatggactgataacatattcgcaataaaatcttgggcca 776 

121P1F1 : 623 aaagaaaatttgggtttgaagaaaataaaattgatagaacttttggaattccagaagact 682 

50 I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

Variant 4: 777 aaagaaaatttgggtttgaagaaaataaaattgatagaacttttggaattccagaagact 836 

121P1F1 : 683 ttgactacatagactaaaatattccatggtggtgaaggatgtacaagcttgtgaatatgt 742 

55 I I I I II I I I I I I I II I I II I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I II I I I I I I 

Variant 4: 837 ttgactacatagactaaaatattccatggtggtgaaggatgtacaagcttgtgaatatgt 896 



121P1F1 : 743 aaattttaaactattatctaactaagtgtactgaattgtcgtttgcctgtaactgtgttt 802 
60 I I | I I I I I || I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I M I 

Variant 4: 897 aaattttaaactattatctaactaagtgtactgaattgtcgtttgcctgtaactgtgttt 956 



175 



511582003440 



121P1F1 : 803 atcattttattaatgttaaataaagtgtaaaatgca 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Variant 4: 957 atcattttattaatgttaaataaagtgtaaaatgca 992 



Table XXIVD. Amino acid sequence alignment of 121P1F1 (SEQ ID NO 67) and splice 
variant 4. (SEQ ID NO 68) . 



10 



15 



20 



25 



30 



35 



Score = 380 bits (975), Expect = e-105Identities 
190/190 (100%) 



190/190 (100%), Positives = 



121P1F1 
Variant 4 
121P1F1 
Variant 4 
121P1F1 
Variant 4 
121P1F1 
Variant 4 



16 MMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMVDCERIGTSNYYWAFP 7 5 

MMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMVDCERIGTSNYYWAFP 
1 MMEIFSETKDVFQLKDLEKIAPKEKGITAMSVKEVLQSLVDDGMVDCERIGTSNYYWAFP 60 

7 6 SKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEERTRLAKELSSLRDQRE 135 

SKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEERTRLAKELSSLRDQRE 
61 SKALHARKHKLEVLESQLSEGSQKHASLQKSIEKAKIGRCETEERTRLAKELSSLRDQRE 120 

136 QLKAEVEKYKDCDPQVVEEIRQANKVAKEAANRWTDNIFAIKSWAKRKFGFEENKIDRTF 195 

QLKAEVEKYKDCDPQVVEE I RQANKVAKE AANRWTDN I FAI KSWAKRKFG FEENKI DRT F 
121 QLKAEVEKYKDCDPQVVEE I RQANKVAKE AANRWTDN I FAI KSWAKRKFG FEENKI DRT F 180 

196 GIPEDFDYID 205 

GIPEDFDYID 
181 GIPEDFDYID 190 



Table XXVD . Peptide sequences from the translation of the nucleotide sequence of 
splice variant 4. (SEQ ID NO 69) . 

MMEIFSETKD VFQLKDLEKI APKEKGITAM SVKEVLQSLV DDGMVDCERI GTSNYYWAFP 60 
SKALHARKHK LEVLESQLSE GSQKHASLQK SIEKAKIGRC ETEERTRLAK ELSSLRDQRE 120 
QLKAEVEKYK DCDPQVVEEI RQANKVAKEA ANRWTDNI FA IKSWAKRKFG FEENKIDRTF 180 
GIPEDFDYID 190 



176 



511582003440 



Table XXVI 

MHC Class 1 nonamer and decamer analysis of 121 PI Fl for selected alleles. Listed are scores that fall within the 
top 50% (rounded up) of all scores for the selected allele. 

SEQ ID NO: 



HLA-A*0201 nonamers 




Pos 


123456789 


score 


122 


RLAKELSSL 


28 


78 


ALH ARKHKL 


25 


42 


ITAMS VKEV 


23 


46 


SVKEVLQSL 


23 


129 


SLRDQREQL 


23 


34 


KI APKEKGI 


22 


102 


SLQKSIEK A 


22 


85 


KLE VLESQL 


21 


196 


GIPEDFDYI 


19 


15 


RMMEIFSET 


17 


18 


EIFSETKDV 


17 


27 


FQLKDLEKI 


17 


80 


HARKHKLEV 


17 


165 


A ANRWTDNI 


17 


50 


VLQSL VDDG 


16 


81 


ARKHKLEVL 


16 


88 


VLESQLSEG 


16 


92 


QLSEGSQKH 


16 


21 


SETKD V FQL 


15 


43 


TAMSVKEVL 


15 


136 


QLKAEVEKY 


15 


6 


GLS AEEKRT 


14 


28 


QLKDLEKI A 


14 


71 


YWAFPSKA L 


14 


133 


QREQLKAE V 


14 


147 


CDPQ V VEEI 


14 


150 


QVVEEIRQA 


14 


189 


N KI DRIFG I 


14 




HLA-A1 nonamers 




Pos 


123456789 


score 


195 


FGIPEDFD Y 


2 


136 


QLKAEVEKY 


19 


169 


WTDNIFAIK 


19 


23 


TKDVFQLKD 


18 


116 


ETEE RTRL A 


18 



ID NO: 3: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids. 

the end 
position for 
each peptide is 

the start 
position plus 
eight 



SEQ IP NO: 



Portion of SEP ID 
NO: 3; each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the staif 77 
position plus eight 



511582003440 



HLA-Alnonamers SEQ1PNO: 



Pos 


123456789 


score 


62 


CERIGTSN Y 


17 


117 


TEERTRLAK 


17 


124 


AKELSSLRD 


17 


146 


DCDPQVVEE 


17 


63 


ERIGTSN Y Y 


16 


106 


SIEKAKIGR 


16 


20 


FSETKD VFQ 


15 


59 


MVDCERIGT 


15 


93 


LS EG SQKH A 


15 


29 


LKDLEKI AP 


14 


88 


VLESQLSEG 


14 


185 


GFEENKJDR 


14 


8 


S AEEKRTRM 


13 


22 


ETKDVFQLK 


13 


31 


DLEKI APKE 


13 


47 


VKE VLQSL V 


13 


55 


VDDGMVDCE 


13 


144 


YKDCDPQVV 


13 


190 


KIDRTFG I P 


13 


9 


AEEKRTRMM 


12 


37 


PKEKG1TAM 


12 


54 


LVDDGMVDC 


12 


130 


LRDQREQLK 


12 


138 


KAE VEKYKD 


12 


151 


V VEEIRQAN 


12 


162 


AKEAANRWT 


12 


1 


MSKKKG LS A 


11 


45 


MS VKE VLQS 


11 


61 


DCERIGTSN 


11 


85 


KLE V L ESQ L 


11 


140 


E VEK YKDCD 


11 


152 


VE EI RQANK 


11 


186 


FEENKI DRT 


11 


13 


RTRMMEIFS 


10 


16 


MMEIFSETK 


10 


114 


RCETEERTR 


10 


133 


QREQLK A E V 


10 


197 


IPEDFDYID 


10 



HLA-A26 nonamers 



SEQ ID NO: 

178 



511582003440 



Pos 


123456789 


score 


46 


SVKEVLQSL 


2' 


66 


GTSN Y Y W AF 


25 


122 


RLAKELSSL 


24 


136 


QLKAEVEKY 


24 


193 


RTFGIPEDF 


24 


22 


ETKDVFQLK 


23 


49 


EVLQSLVDD 


23 


25 


DVFQLKDLE 


20 


63 


ERIGTSNYY 


20 


87 


EVLESQLSE 


20 


18 


EIFSETKDV 


19 


85 


KLEVLESQL 


19 


129 


SLRDQREQL 


19 


19 


IFSETKDVF 


18 


95 


EGSQKHASL 


18 


116 


ETEERTRL A 


18 


31 


DLEKI APKE 


17 


42 


ITAMS VKEV 


17 


54 


LVDDGM VDC 


17 


78 


ALHARKHKL 


17 


126 


ELSSLRDQR 


17 


140 


EVEKYKDCD 


17 


150 


Q V VEEI RQA 


17 


154 


EI RQANKVA 


17 


187 


EENKI DRTF 


17 


196 


GIPEDFDYI 


17 


88 


VLESQLSEG 


16 


119 


ERTRLAKEL 


16 


146 


DCDPQVVEE 


16 


169 


WTDNIFAI K 


16 


34 


KI APKEKGl 


15 


102 


SLQKSIEKA 


15 


190 


KI DRTFG I P 


15 


12 


KRTRMME1F 


14 


21 


SETKD VFQL 


14 


37 


PKEKG ITAM 


14 


50 


VLQSLVDDG 


14 


81 


ARKHKLEVL 


14 


132 


DQREQLK AE 


14 


151 


V VEE1RQAN 


14 



Portion of SEP ID 
NO: 3: each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



179 



511582003440 





HLA-A26 nonamers 




Pos 


123456789 


score 


160 


KVAKEAANR 


14 


195 


FGIPEDFDY 


14 


24 


KDVFQLKDL 


13 


171 


DNIFAIKSW 


13 


172 


NIFAIKSWA 


13 


175 


AIKSWAKRK 


13 


178 


SWAKRKFGF 


13 




HLA-A3 nonamers 




Pos 


123456789 


score 


175 


AIKSWAKRK 


2i 


160 


KVAKEAANR 


24 


40 


KGIT AM S V K 


23 


91 


SQLSEGSQK 


22 


136 


QLKAEVEKY 


21 


30 


KDLEKIAPK 


20 


53 


SLVDDGMVD 


20 


122 


RL AKELSSL 


20 


85 


K LE V L ESQ L 


19 


92 


QLSEGSQKH 


19 


129 


S LRDQREQ L 


19 


155 


IRQANKVAK 


19 


87 


EVLESQLSE 


18 


97 


SQKH ASLQK 


18 


117 


TEERTRL AK 


18 


126 


ELSSLRDQR 


18 


4 


KKGLSAEEK 


17 


54 


LVDDGMVDC 


17 


78 


ALH ARKHKL 


17 


34 


K I AP KE KG I 


16 


46 


SVKEVLQSL 


16 


49 


EVLQSLVDD 


16 


69 


NYYWAFPSK 


16 


75 


PSK ALH ARK 


16 


77 


KALHARKHK 


16 


101 


ASLQKS I E K 


16 


135 


EQLKAEVEK 


16 


150 


QVVEEIRQA 


16 


152 


VEEIRQANK 


16 


173 


I FA I KSWAK 


16 



SEQ ID NO: 



SEQ ID NO: 



Portion of SEP ID 
NO: 3: each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



180 



511582003440 





HLA-A3 nonamers 




Pos 


123456789 


score 


182 


RKFGFEENK 


16 


16 


MMEIFSETK 


15 


26 


VFQLKDLEK 


15 


62 


CERIGTSN Y 


15 


111 


KIGRCETEE 


15 


154 


EIRQANKVA 


15 


190 


K I DRTFG I P 


15 


28 


QLKDL EKI A 


14 


41 


GITAMSVKE 


14 


110 


AKIG RCETE 


14 


169 


WTDNI FAIR 


14 


172 


N I FA IKS W A 


14 


22 


ETKDVFQLK 


13 


31 


DLEKI APKE 


13 


32 


LEKIAPKEK 


13 


36 


APKEKGITA 


13 


88 


VLESQLSEG 


13 


106 


SIEKAKIGR 


13 


134 


REQLKAEVE 


13 


137 


LKAEVEKYK 


13 


151 


VVEEIRQAN 


13 


6 


GLSAEEKRT 


12 


64 


RIGTSNYYW 


12 


103 


LQKSIEKA K 


12 


114 


RCETEERTR 


12 


130 


LRDQREQLK 


12 


145 


KDCDPQ V VE 


12 


195 


FGIPEDFD Y 


12 




HLA-B*0702 nonamers 




Pos 


123456789 


score 


36 


APKEKGITA 


1< 


71 


YWAFPSKAL 


15 


74 


FPSKALHAR 


14 


95 


EGSQKH ASL 


14 


78 


ALH ARKHKL 


13 


81 


ARKH KLE V L 


13 


122 


RLAKELSSL 


13 


129 


SLRDQREQL 


13 


21 


SETKDVFQL 


12 



SEQ ID NO: 



SEQ1DNO: 



NO: 3; each start 
position is 
specified. the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



181 



511582003440 





HLA-B*0702 nonamers 




Pos 


123456789 


score 


43 


TAMSVKEVL 


12 


115 


CETEERTRL 


12 


24 


KDVFQLKDL 


11 


80 


HARKHKLEV 


11 


85 


KLEVLESQL 


11 


119 


ERTRL AKEL 


11 


197 


1PEDFDYID 


11 


1 


MSKKKGLSA 


10 


9 


AEEKRTRMM 


10 


19 


1FSETKD VF 


10 


46 


SVKE VLQSL 


10 


73 


AFPSK ALH A 


10 


148 


DPQ VVEEIR 


10 


154 


EIRQANKVA 


10 


166 


ANR WTDNIF 


10 


6 


GLS AEEKRT 


9 


11 


EKRTRMMEI 


9 


15 


RMMEIFSET 


9 


34 


KI APKEKGI 


9 


37 


PKEKGITAM 


9 


42 


IT AMSVKEV 


9 


66 


GTSNYYWAF 


9 


104 


QKSIEKAKI 


9 


131 


RDQREQLKA 


9 


158 


ANKVAKEAA 


9 


162 


AKEA ANR WT 


9 


165 


A ANR WTDNI 


9 


176 


IKSWAKRKF 


9 


193 


RTFGIPEDF 


9 




HLA-B*08 nonamers 




Pos 


123456789 


score 


81 


ARKHKLEVL 


3( 


36 


APKEKG IT A 


28 


46 


SVKEVLQSL 


24 


78 


ALHARKHKL 


24 


129 


SLRDQREQL 


24 


179 


WAKRKFGFE 


24 


11 


EKRTRMMEI 


23 


95 


EGSQKH ASL 


22 



SEQ ID NO: 



SEQ ID NO: 



NO: 3; each start 
position is 
specified. the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



182 



511582003440 





HLA-B*08 nonamers 




Pos 


123456789 


score 


107 


1EKAKIGRC 


22 


141 


VEKYKDCDP 


22 


34 


KIAPKEKG1 


21 


1 


MSKKKGLSA 


20 


8 


SAEEKRTRM 


18 


28 


QLKDLEKI A 


17 


85 


KLE V LESQL 


17 


136 


QLKAE VEKY 


17 


161 


VAKEAANRW 


17 


118 


EERTRLAKE 


16 


122 


RL AKELSSL 


16 


121 
i j~ ~> 


1 AKELSSI R 


16 


1 78 

I/O 


SWAKRKFGF 

o vv /a in. iv rv I VJ 1 


16 




KAKIGRCFT 

rv r\ rv i vj iv i_/ i 


15 


175 

1 1 j 


AIKSWAKRK 

/V I ff\ O f 1 IV. IV IV 


15 




HLA-B*1510 nonamers 




Pos 


123456789 


score 


43 


TAMSVKEVL 


1^ 


71 


YWAFPSKAL 


14 


115 


CETEERTRL 


14 


19 


I FSETKD V F 


13 


95 


EGSQKH ASL 


13 


21 


SETKD VFQL 


12 


81 


ARKHKLEVL 


12 


83 


KHKLE VLES 


12 


85 


KLEVLESQL 


12 


119 


ERTRL AKEL 


12 


122 


RLAKELSSL 


12 


129 


SLRDQREQL 


12 


176 


I KSWAKRKF 


12 


8 


SAEEKRTRM 


1 1 


37 


PKEKGITAM 


1 1 


46 


SVKEVLQSL 


11 


78 


ALHARKHKL 


1 1 


79 


LHARKHKLE 


11 


99 


KH ASLQKSI 


11 


187 


EENKIDRTF 


11 


9 


AEEKRTRMM 


10 



SEQ ID NO: 



SEQ ID NO: 



Portion of SEP ID 
NO: 3; each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



183 



511582003440 





HL A-B * 1 5 1 0 nonamers 




Pos 


123456789 


score 


24 


KD VFQLKDL 


10 


66 


GTSN Y Y WAF 


9 


178 


SWAKRKFGF 


9 


193 


RTFGIPEDF 


8 


12 


KRTRMMEIF 


7 


51 


LQSLVDDGM 


7 


155 


IRQANKVAK 


7 



SEQ ID NO: 



HLA-B*2705 nonamers 

Pos 123456789 score 

63 E R I G TSNYY 24 

81 ARKHKLEVL 24 

119 ERTRLAKEL 23 

155 IRQANKVAK 23 

12 KRTRMMEIF 22 

130 LRDQREQLK 22 

182 RKFGFEENK 21 

30 KDLEKIAPK 20 

122 RLAKELSSL 20 

193 RTFGIPEDF 20 

101 ASLQKSIEK 19 

160 KVAKEAANR 19 

174 FAIKSWAKR 18 

37 PKEKGITAM 17 

192 DRTFGIPED 17 

4 KKGLSAEEK 16 

5 KGLSAEEKR 16 
40 KGITAMSVK 16 

113 GRCETEERT 16 

114 RCETEERTR 16 

115 CETEERTRL 16 
133 QREQLKAEV 16 
135 EQLKAEVEK 16 
185 GFEENKIDR 16 
14 TRMMEIFSE 15 
26 VFQLKDLEK 15 
72 WAFPSKALH 15 
85 KLEVLESQL 15 
91 SQLSEGSQK 15 



SEQ ID NO: 



Portion of SEP ID 
NO: 3; each start 
position is 
specified the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



184 



511582003440 





HLA-B*2705 nonamers 




Pos 


123456789 


score 


95 


EGSQKHASL 


15 


121 


TRL AKELSS 


15 


152 


VEEIRQANK 


15 


181 


KRKFGFEEN 


15 


187 


EENK1DRTF 


15 


7 


LSAEEKRTR 


14 


8 


S AEEKRTRM 


14 


19 


IFSETKDVF 


14 


21 


SETKDVFQL 


14 


24 


KDVFQLKDL 


14 


46 


S VKEVLQSL 


14 


66 


GTSN Y Y W AF 


14 


69 


NYYWAFPSK 


14 


75 


PSKALH ARK 


14 


77 


KALHARKHK 


14 


78 


ALHARKHKL 


14 


92 


QLSEGSQKH 


14 


106 


SIEKAKIGR 


14 


123 


LAKELSSLR 


14 


173 


IFAIKSWAK 


14 


175 


AIKS WAKRK 


14 


176 


IKSWAKRKF 


14 


27 


FQLKDLEKI 


13 


43 


TAMSVKEVL 


13 


56 


DDGM VDCER 


13 


62 


CERIGTSN Y 


13 


74 


FPSKALHAR 


13 


97 


SQKHASLQK 


13 


112 


1GRCETEER 


13 


166 


ANRWTDNIF 


13 


168 


RWTDN1FAI 


13 


178 


SWAKRKFGF 


13 


195 


FGIPEDFDY 


13 


16 


M M EI FSETK 


12 


71 


YWAFPSKAL 


12 


76 


SKALH ARKH 


12 


99 


KH ASLQKSI 


12 


126 


ELSSLRDQR 


12 


136 


QLKAEVEKY 


12 



185 



511582003440 





HLA-B*2705 nonamers 




Pos 


1 23456789 


score 


137 


LKAEVEKYK 


12 


167 


NR WTDNIF A 


12 


169 


WTDN I F A 1 K 


12 


183 


KFGFEENKI 


12 



SEQ ID NO: 



HLA-B*2709 nonamers 

Pos 123456789 score 

119 ERTRLAKEL 22 

12 KRTRMMEIF 21 

81 ARKHKLEVL 21 

133 QREQLKAEV 18 

193 RTFGIPEDF 15 

21 SETKDVFQL 14 

113 GRCETEERT 14 

122 RLAKELSSL 14 

24 KDVFQLKDL 13 

85 KLEVLESQL 13 

121 TRLAKELSS 13 

168 RWTDNIFAI 13 

115 CETEERTRL 12 

143 KYKDCDPQV 12 

155 IRQANKVAK 12 

181 KRKFGFEEN 12 

192 DRTFGIPED 12 

196 GIPEDFDYI 12 

18 EIFSETKDV 1 

27 FQLKDLEK1 1 

34 KI APKEKGI 1 

43 TAMSVKEVL 1 

52 QSLVDDGMV 1 

63 ERIGTSNYY 1 

66 GTSNYYWAF 1 

78 ALHARKHKL 1 

99 KH ASLQKSI 1 

129 SLRDQREQL 1 

167 NRWTDNIFA 1 



SEQ ID NO: 



Portion of SEQ ID 
NO: 3; each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



HLA-BM402 nonamers 
Pos 123456789 



score 



SEQ ID NO: 



186 



511582003440 



187 


EENKIDRTF 




21 


SETKD VFQL 


23 


62 


CERIGTSN Y 


21 


115 


CETEERTRL 


21 


153 


EEIRQANKV 


19 


171 


DNIFAIKSW 


18 


63 


ERIGTSN Y Y 


17 


9 


AEEKRTRMM 


16 


78 


ALH ARKHKL 


16 


118 


EERTRLAKE 


16 


119 


ERTRLAKEL 


16 


195 


FGIPEDFDY 


16 


81 


ARKHKLEVL 


15 


117 


TEERTRLAK 


15 


139 


AEVEKYKDC 


15 


168 


RWTDNIFAI 


15 


189 


NKIDRTFGI 


15 


193 


RTFGIPEDF 


15 


10 


EEKRTRMM E 


14 


17 


MEIFSETKD 


14 


24 


KD VFQLKDL 


14 


34 


KI APKEKGI 


14 


38 


KEKGITAMS 


14 


48 


KEVLQSLVD 


14 


66 


GTSN Y Y W AF 


14 


71 


YWAFPSKAL 


14 


94 


S EGSQKH AS 


14 


125 


KELSSLRDQ 


14 


129 


SLRDQREQL 


14 


163 


KEAANRWTD 


14 


166 


ANRWTDNI F 


14 


186 


FEENKIDRT 


14 


32 


LEKIAPKEK 


13 


95 


EGSQKHASL 


13 


107 


IEKAKIGRC 


13 


134 


REQLKAEVE 


13 


165 


A ANRWTDN 1 


13 


176 


IKS WAKRKF 


13 


11 


EKRTRMME1 


12 


12 


KRTRMMEIF 


12 


19 


IFSETKDVF 


12 



25 



Portion of SEP ID 
NO: 3: each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



187 



511582003440 



43 


TAMSVKEVL 


12 


46 


SVKEVLQSL 


12 


85 


KLE VLESQL 


12 


86 


LE VLESQLS 


12 


136 


QLKAEVEKY 


12 


161 


VAKEAANRW 


12 


178 


SWAKRKFGF 


12 



HLA-B*5101 nonamers 

Pos 123456789 score 

43 TAMSVKEVL 22 

57 DGMVDCERI 21 

80 HARKHKLEV 20 

165 AANRWTDNI 20 

27 FQLKDLEKI 17 

36 APKEKGITA 16 

148 DPQVVEEIR 16 

161 VAKEAANRW 16 

8 SAEEKRTRM 15 

147 CDPQVVEEI 15 

157 QANKVAKEA 15 

174 FAIKSWAKR 15 

35 IAPKEKGIT 14 

42 ITAMSVKEV 14 

77 KALHARKHK 14 

123 LAKELSSLR 14 

144 YKDCDPQVV 14 

196 GIPEDFDYI 14 
74 FPSKALHAR 13 
95 EGSQKHASL 13 

183 KFGFEENKI 13 

197 IPEDFDYID 13 
34 KIAPKEKGI 12 
72 WAFPSKALH 12 
104 QKSIEKAKI 12 
138 KAEVEKYKD 12 
153 EEIRQANKV 12 
168 RWTDNIFAI 12 
179 WAKRKFGFE 12 

184 FGFEENKID 12 
189 NKIDRTFGI 12 
11 EKRTRMMEI 11 



SEQ ID NO: 



Portion of SEP ID 
NO: 3; each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



188 



511582003440 





HLA-B*5101 nonamers 




Pos 


123456789 


score 


46 


SVKEVLQSL 


11 


81 


ARKHKLE VL 


11 


99 


KH ASLQKSI 


11 


164 


EAANRWTDN 


11 



SEQ I D NO: 



HLA-A*0201 decamers 
Pos 1234567890 

41 GITAMSVKEV 

77 KALHARKHKL 

42 ITAMSVKEVL 

80 HARKHKLEVL 

121 TRLAKELSSL 
34 KIAPKEKGIT 

46 SVKEVLQSLV 

79 LHARKHKLEV 

45 MSVKEVLQSL 

50 VLQSLVDDGM 

94 SEGSQKHASL 

26 VFQLKDLEKI 

44 AMSVKEVLQS 

53 SLVDDGMVDC 

58 GMVDCERIGT 

92 QLSEGSQKHA 

132 DQREQLKAEV 

146 DCDPQVVEEI 

20 FSETKDVFQL 

38 KEKGITAMSV 

84 HKLEVLESQL 

101 ASLQKSIEKA 

128 SSLRDQREQL 
167 NRWTDNIFAI 
182 RKFGFEENKI 

6 GLSAEEKRTR 

15 RMMEIFSETK 

23 TKDVFQLKDL 

64 R1GTSN YYWA 

70 YYWAFPSKAL 
103 LQKSIEKAKI 
106 SIEKAKIGRC 

129 SLRDQREQLK 



SEQ ID NO: 



score 



23 Portion of SEP ID 
NO: 3; each start 
20 position is specified. 
j g the length of each 
peptide is 10 amino 
'8 acids, the end 
]g position for each 
peptide is the start 
17 position plus nine 
17 

17 

16 

16 

16 

15 

15 

15 

15 

15 

15 

15 

14 

14 

14 

14 

14 

14 

14 

13 

13 

13 

13 

13 

13 

13 

13 

189 



511582003440 





HLA-A*0201 decamers 




Pos 


1 2345 67890 


score 


152 


VEEIRQANKV 


13 


195 


FGIPEDFD YI 


13 


35 


I APKEKGITA 


12 


36 


APKEKG ITAM 


12 


51 


LQSLVDDGM V 


12 


72 


W AFPSKALH A 


12 


102 


SLQKSIEKAK 


12 


122 


RLAKELSSLR 


12 


196 


GIPEDFDYID 


12 




HLA-A*0203 decamers 




Pos 


1 2 3 4567 890 


score 


157 


QANKVAKEAA 




158 


AN K V AKE A AN 


17 


27 


FQLKDLEKI A 


10 


35 


I APKEKG IT A 


10 


64 


RIGTSN Y Y W A 


10 


69 


NYY WAFPSKA 


10 


72 


WAFPSKALH A 


10 


92 


QLSEGSQKH A 


10 


101 


ASLQKSI EKA 


10 


115 


CETEERTRLA 


10 


130 


LRDQREQLK A 


10 


149 


PQ V VEEIRQ A 


10 


153 


EEIRQ AN K VA 


10 


156 


RQANK VA KE A 


10 


166 


ANRWTDNI FA 


10 


171 


DNIFAIKSWA 


10 


1 


MSKKKGLSAE 


9 


28 


QLKDLEKI AP 


9 


36 


APKEKG IT AM 


9 


65 


IGTSNYYWAF 


9 


70 


YYWAFPSKAL 


9 


73 


AFPSKALH AR 


9 


93 


LSEGSQKH AS 


9 


102 


SLQKSIEKAK 


9 


116 


ETEERTRLAK 


9 


131 


RDQREQLK AE 


9 


150 


QV VEE IRQ AN 


9 


154 


E I RQ AN K V AK 


9 



SEQ ID NO: 



19 



SEQ I D NO: 



Portion of SEQ 
ID NO: 3: each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is the 
start position plus 
nine 



190 



511582003440 





HLA-A*0203 decamers 




Pos 


1 234 5 6 7 890 


score 


167 


NRWTDNIFAI 


9 


172 


N1FAIKSWAK 


9 




HLA-A1 decamers 




Pos 


1234567890 


score 


61 


DC ERIGTSN Y 


2 


116 


ETEERTRLAK 


23 


169 


WTDN I F A I KS 


22 


47 


VKEVLQSLVD 


18 


130 


LRDQREQLKA 


18 


135 


EQLKA EVE KY 


18 


20 


FSETKD VFQ L 


16 


62 


CERIGTSNY Y 


15 


93 


LSEGSQKH AS 


15 


146 


DCDPQV VEEI 


15 


190 


KIDRTFG 1PE 


15 


194 


TFG I PEDFD Y 


15 


22 


ETKDVFQLKD 


14 


8 


S A EEKRTRM M 


13 


9 


AEEKRTRMM E 


13 


85 


KLEVLESQLS 


13 


144 


YKDCDPQ V VE 


13 


152 


VEEI RQ AN KV 


13 


16 


MM EIFSETKD 


12 


55 


VDDGMVDCE R 


12 


88 


VLESQLSEGS 


12 


106 


SIEKA K1GRC 


12 


117 


TEERTRL A KE 


12 


120 


RTRLAKELSS 


12 


162 


AKEAANRWTD 


12 



SEQ ID NO: 



HLA-A26 decamers 
Pos 1234567890 

18 EIFSETK'DVF 

87 EVLESQLSEG 

175 AIKS WAKRKF 

135 EQLKAEVEKY 

49 EVLQSLVDDG 

11 EKRTRMMEIF 

25 DVFQLKDLEK 



25 



SEQ ID NO: 



Portion of SEP 
ID NO: 3: each 
start position is 

specified. the 
length of each 

peptide is 10 
amino acids, the 
end position for 



SEQ ID NO: 



score 



29 



Portion of SEP 
ID NO: 3; each 
24 start position is 
23 specified, the 

length of each 
22 peptide is 10 

21 amino acids, the 
end position for 
each peptide is the 
20 start position plus 
nine 

191 



20 



511582003440 



HLA-A26 decamers SEQ I D NO: 



Pos 


1 23 4 5 6 7 890 


score 


22 


ETKD VFQLKD 


19 


42 


ITAMSVKEVL 


19 


116 


ETEERTRLAK 


19 


154 


EIRQANKVAK 


19 


50 


VLQSLVDDGM 


18 


61 


DCERIGTSN Y 


18 


126 


ELSSLRDQRE 


17 


140 


EVEKYKDCDP 


17 


31 


DLEKI APKEK 


16 


36 


APKEKGITAM 


16 


54 


LVDDGM VDCE 


16 


65 


1GTSN YYWAF 


16 


106 


SIEKAKIGRC 


16 


192 


DRTFGIPEDF 


16 


194 


TFGIPEDFD Y 


16 


13 


RTRMMEIFSE 


15 


41 


GITAMSVKEV 


15 


45 


MSVKEVLQSL 


15 


59 


MVDCERIGTS 


15 


118 


EERTRLAKEL 


15 


46 


SVKEVLQSLV 


14 


53 


SLVDDGMVDC 


14 


64 


RIGTSN Y Y W A 


14 


121 


TRLAKELSSL 


14 


146 


DCDPQV VEE1 


14 


150 


QVVEEIRQAN 


14 


151 


VVEEIRQANK 


14 


193 


RTFG1PEDFD 


14 





HLA-A3 decamers 




SEQ ID NO: 


Pos 


1 234567890 


score 




154 


EIRQANKVAK 


26 


Portion ofSEO 
ID NO: 3: each 


129 


SLRDQREQLK 


25 


start position is 


136 


QLKAEVEKYK 


25 


specified, the 
length of each 


151 


VVEEIRQANK 


24 


peptide is 10 


25 


DVFQLKDLEK 


23 


amino acids, the 
end position for 
each peptide is the 


102 


SLQKSIEK AK 


22 


122 


RLA KELSSLR 


22 


start position plus 
nine 


31 


DLEKI APKEK 


21 



192 



511582003440 



HLA-A3 decamers 



Pos 


1234567890 


172 


NIFAIKS WAK 


6 


GLSAEEKRTR 


90 


ESQLSEGSQK 


3 


KKKGLS AEEK 


15 


RMMEIFSETK 


134 


REQLK AE V EK 


39 


EKGITAMSVK 


111 


KIGRCETEER 


168 


RWTDNIFAI K 


68 


SNY Y WAFPSK 


160 


KVAKEAANRW 


190 


KIDRTFG 1PE 


18 


EIFSETKD VF 


34 


KIAPKEKG IT 


46 


SVKEVLQSLV 


53 


S L V DDGM V DC 


87 


EVLESQLSEG 


96 


GSQKH AS LQK 


116 


ETEERTRLAK 


174 


FA1KSWAKRK 


175 


AIKS WAKRKF 


28 


QLKDLEKI AP 


59 


M VDCERIGTS 


78 


ALH ARKHKLE 


150 


QVV EEIRQ AM 


29 


LKDLEKI APK 


76 


SKALHARKHK 


1 ft 1 


N. tx K. r u £ c c IX IS. 


64 


R1GTSNYYWA 


74 


FPSKALH ARK 


85 


KLE VLESQLS 


92 


QLSEGSQKHA 


120 


RTRL AKELSS 


125 


KELSSLRDQR 




HLA-B*0702 decamers 


Pos 


1234567890 


36 


APKEKGITAM 


74 


FPSKALHARK 


80 


HARKHKLEVL 



SEQ ID NO: 

score 
21 
20 
20 
19 
19 
19 
18 
18 
18 
17 
17 
17 
16 
16 
16 
16 
16 
16 
16 
16 
16 
15 
15 
15 
15 
14 
14 
14 
13 
13 
13 
13 
13 
13 

SEQ ID NO: 

score 

20 Portion of SEP 
ID NO: 3: each 
14 start position is 
14 specified, the 
length of each 
peptide A d4 
amino acids, the 
end position for 
each peptide is the 
start position plus 
nine 



511582003440 





HLA-B*0702 decamers 




Pos 


1234567890 


score 


42 


1TAMSVKEVL 


13 


114 


RCETEERTRL 


13 


118 


EERTRLAKEL 


13 


70 


Y Y WAFPSKAL 


12 


94 


SEGSQKHASL 


12 


20 


FSETKDVFQL 


11 


23 


TKDVFQLKDL 


11 


45 


MS V KE VLQSL 


11 


77 


K ALH ARKHKL 


11 


121 


TRLAKELSSL 


1 1 


128 


SSLRDQREQL 


11 


166 


ANRWTDNIFA 


1 1 


84 


HKLEVLESQL 


10 


108 


EKAKIGRCET 


10 


148 


DPQV VEEIRQ 


10 




HLA-BM402 decamers 




Pos 


1 2 3 4 567890 


score 


118 


EERTRLAKEL 


2 


186 


FEENKIDRTF 


23 


10 


EEKRTRMME1 


21 


62 


CERIGTSN Y Y 


21 


94 


SEGSQKHASL 


21 


153 


EEI RQ ANK V A 


19 


17 


M EIFSETKDV 


16 


63 


ERIGTSN Y Y W 


16 


18 


EIFSETKD VF 


15 


33 


EKI APKEKG 1 


15 


128 


SSLRDQREQL 


15 


135 


EQLKAEVEKY 


15 


165 


AANRWTDNIF 


15 


167 


NRWTDNI FAI 


15 


170 


TDNIFAIKSW 


15 


175 


AI KSWAKRKF 


15 


195 


FG 1PEDFD Y I 


15 


9 


AE EKRTRMM E 


14 


23 


TKDVFQLKDL 


14 


48 


KEVLQSLVDD 


14 


70 


YY WAFPSKAL 


14 


77 


KALH ARKHKL 


14 



SEQ ID NO: 



SEQ ID NO: 



NO: 3; each start 
position is 
specified, the 
length of each 
peptide is 10 amino 
acids, the end 
position for each 



194 



511582003440 



HLA-B*4402 decamers SEQ I D NO: 



Pos 


1234567890 


score 


125 


KELSSLRDQR 


14 


11 


EKRTRMMEIF 


13 


20 


FSETKD VFQL 


13 


21 


SETKD VFQLK 


13 


38 


KEKGITAMSV 


13 


115 


CETEERTRL A 


13 


117 


TEERTRLAKE 


13 


139 


AEVEKYKDCD 


13 


146 


DCDPQ V VEEI 


13 


152 


VEEIRQ ANKV 


13 


160 


KVAKEAANRW 


13 


182 


RKFGFEENKI 


13 


187 


EENK1DRTFG 


13 



Class I nonamer analysis of amino acids 85-126 

(KLEVLESQDPGCCFHEIIKVSYYRKFWLGAVAHACNPSTLGG) of 121 PI Fl splice variant la. Listed are 
5 those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the analysis 
of the full length 12 1 PI Fl base peptide sequence. 



10 



HLA-A*0201 nonamers 
Pos 123456789 
96 CCFHE1IKV 
116 AHACNPSTL 
107 YRKF WLGAV 



110 FWLGAVAHA 

HLA-A1 nonamers 

Pos 123456789 

98 FHEIIKVSY 

91 SQDPGCCFH 

99 HEIIKVSYY 

88 VLESQDPGC 

85 KLEVLESQD 

118 ACNPSTLGG 



SEQ ID NO: 



score 



| 7 Portion of SEP ID 
NO: 5: each start 
16 position is 

15 specified, the 

length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
15 position plus eight 



SEQ ID NO: 



score 



18 
16 
14 
11 

11 



25 Portion of SEQ ID 
NO: 5; each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



195 



511582003440 



HLA-A26 nonamers 

Pos 123456789 

87 EVLESQDPG 

100 EIIK VSYYR 
99 HEIIKVSYY 
90 ESQDPGCCF 

101 IIKVSYYRK 



SEQ ID NO: 



102 



IKVSYYRKF 



HLA-A3 nonamers 

Pos 123456789 

101 IIKVSYYRK 

85 KLEVLESQD 

109 KFWLGAVAH 

111 WLGA VAHAC 

100 EI]K VSYYR 

99 HEIIKVSYY 

103 KVSYYRKFW 

108 RKFWLGAVA 

114 AVAHACNPS 

87 EVLESQDPG 

98 FHEIIKVSY 

116 AHACNPSTL 



score 



19 Portion of SEP ID 
NO: 5; each start 
19 position is 

lg specified, the 
length of each 
17 peptide is 9 amino 
17 acids, the end 
position for each 
peptide is the start 
16 position plus eight 



SEQ ID NO: 



score 
21 

19 
18 



Portion of SEQ ID 
NO: 5: each start 
position is 
specified, the 
length of each 
17 peptide is 9 amino 
16 acids, the end 
position for each 
peptide is the start 



14 



14 position plus eight 

14 

14 

13 

13 

12 





HLA-B*0702 nonamers 




Pos 


123456789 


score 


93 


DPGCCFHEI 


1 


116 


AHACNPSTL 


13 


90 


ESQDPGCCF 


11 


106 


YYRKFWLGA 


11 


104 


VSYYRKFWL 


10 


108 


RKFWLGAVA 


10 


110 


FWLGAVAHA 


10 



SEQ ID NO: 



NO: 5: each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 



HLA-B*08 nonamers SEQ ID NO: 

Pos 123456789 score 

104 VSYYRKFWL 20 P °* io " of S ^ " ? 

NO: 5: each start 

position is 

specified, the 

length of each 

peptide is 9 amino 

acids. Ihfgg j 

position for each 

peptide is the start 

position plus eight 



511582003440 





Ul A _D * AO nnngmprc 
nLn"D WO IlUIld.JIlCI b 




Pos 


1 A J H J O / O 7 


score 


101 


1 1 IV Vol I IN. IV 


16 




HLA-B*1510 nonamers 




Pos 


123456789 


score 


116 


AH ACNPSTL 


2< 


98 


FHE! 1KVSY 


14 


104 


VSYYRKFWL 


11 


i m 
1U2 


I 1^ \/ O V V D V I? 

IKVoY Y K Is. r 


I U 


on 




Q 




HLA-B*2705 nonamers 




Pos 


123456789 


score 


100 


EIIKVSYYR 


1 


101 


I IK VS Y YRK 


15 


109 


KFWLGA V AH 


15 


95 


GCCFHEI IK 


14 


103 


IKVSYYRKF 


14 


QO 

yy 


n l i i i\ v o i i 


i ^ 


1 




1 1 
i j 


1 1£ 


n n n ^ IN r j 1 Lj 


1 ^ 


no 

70 


FHPIIkTV^V 
rriciiivvo ■ 


1? 




riLA-t>*z /uy nonamers 




Pos 


123456789 


score 


107 


YRKFWLGAV 


1 


104 


VSYYRKFWL 


12 


102 


IKVSYYRKF 


11 


116 


AH ACN PSTL 


11 



SEQ ID NO: 



SEQ ID NO: 



Portion of SEQ ID 
NO: 5: each start 
position is 
specified. the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



SEQ ID NO: 



NO: 5: each start 
position is 
specified. the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



SEQ ID NO: 



NO: 5; each start 
position is 
specified, the 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 
position plus eight 



HLA-B*4402 nonamers 
Pos 123456789 
99 HEIIKVSYY 



score 



SEQ ID NO: 



24 Portion of SEP ID 
NO: 5: each start 
position is 
specified.Qffe 
length of each 
peptide is 9 amino 

acids, the end 
position for each 
peptide is the start 



511582003440 





HLA-B*4402 nonamers 




Pos 


123456789 


score 


116 


AH ACNPSTL 


16 


103 


KVSYYRKFW 


15 


90 


ESQDPGCCF 


13 


89 


LESQDPGCC 


12 


98 


FHE1IKVS Y 


12 


102 


I KVSYYRKF 


12 



HL A-B *5 1 0 1 nonamers SEQ ID NO: 

Pos 123456789 score 

93 DPGCCFHEI 25 P °* io " of S U EQ 10 

NO: 5: each start 

94 PGCCFHEII 16 position is 

95 CCFHEI1KV 13 specified, the 

length of each 

115 VAHACNPST 13 pep( ide is 9 amino 
113 GAVAHACNP 12 acids, the end 

position for each 

104 VSYYRKFWL 11 4>tide is the start 

107 YRKFWLGAV 11 position plus eight 

117 HACNPSTLG 11 

116 AHACNPSTL 9 



Class 1 decamer analysis of amino acids 84-126 

(HKLEVLESQDPGCCFHEI1KVSY YRKFWLGAV AH ACNPSTLGG) of 121P1F1 splice variant la. Listed are 
those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the analysis 
of the full length 121P1F1 base peptide sequence. 



10 





HLA-A*0201 decamers 




Pos 


1 2345 67890 


score 


106 


Y YRKFWLGAV 


17 


115 


V AH ACNPSTL 


17 


94 


GCCFHEIIKV 


16 


114 


AVAH ACNPST 


15 


103 


K VSYYRKFWL 


14 


92 


QDPGCCFHEI 


13 


109 


KFWLG A V AH A 


12 


111 


WLGAVAHACN 


12 




HLA-A*0203 decamers 




Pos 


1234567890 


score 


107 


YRKFWLGAVA 


18 



SEQ ID NO: 



Portion of SEQ 
ID NO: 5; each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



SEQ ID NO: 

Portion of SEP 
ID NO: 5: each 
start position is 



spec 



, the 



length of each 
peptide is 10 
amino acids, the 
end position for 



511582003440 



HLA-A*0203 decamers 

Pos 1234567890 score 

119 KFWLGAVAHA 18 

105 S YYRKFWLGA 10 

106 YYRKFWLGAV 9 
108 RKFWLGAVAH 9 
110 FWLGAVAHAC 9 

HLA-A1 decamers 

Pos 1234567890 score 

98 FHEIIKVSYY 27 

91 SQDPGCCFHE 16 

97 CFHEIIKVSY 15 



88 VLESQDPGCC 12 



HLA-A26 decamers 

Pos 1234567890 score 

101 1IKVSY YRKF 26 

100 EIIKVSYYRK 24 

87 EVLESQDPGC 20 

97 CFHEIIKVSY 20 

103 KVSYYRKFWL 18 



98 FHEIIKVSYY 15 

HLA-A3 decamers 

Pos 1234567890 score 

100 EIIKVSYYRK 21 

108 RKFWLGAVAH 16 

114 AVAHACNPST 16 

101 IIKVSYYRKF 15 

111 WLGAVAHACN 15 

103 KVSYYRKFWL 14 

85 KLEVLESQDP 13 

87 EVLESQDPGC 13 

97 CFHEIIKVSY 13 



SEQ ID NO: 



SEQ I D NO: 

Portion of SEQ 
ID NO: 5: each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



SEQ ID NO: 

Portion of SEP 
ID NO: 5: each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



SEQ ID NO: 

Portion of SEQ 
ID NO: 5: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



199 



511582003440 



HLA-B*0702 decamers SEQ ID NO: 

Pos 1234567890 score 

93 DPGCCFHEII 17 ^™°S SE ? 

ID NO: 5; each 

103 KVSYYRKFWL 13 start position is 

115 VAHACNPSTL 11 specified, the 

length of each 

106 YYRKF WLGAV 10 peP tide is 10 

amino acids, the 
end position for 
each peptide is 
the start position 

114 AVAHACNPST 10 plus nine 



Pos 

89 
99 



102 



HLA-B*4402 decamers 
1234567890 
LESQDPGCCF 
HEIIKVS YYR 



I KVSYYRKFW 



score 
21 
13 



13 



SEQ ID NO: 



Portion of SEP 
ID NO: 5: each 
start position is 
specified. the 
leneth of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



Class I nonamer analysis of amino acids 1-14 (MKCKMELSEGSQKH) of 121P1F1 splice variant lb. Listed are 
those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the analysis 
of the full length 121P1F1 base peptide sequence. 



Pos 



HLA-A1 nonamers 

123456789 



score 



Pos 



KMELSEGSQ 

HLA-A26 nonamers 
123456789 



ELSEGSQKH 



10 



score 



18 



SEQ ID NO: 

Portion of SEQ 
ID NO: 7: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 

Portion of SEP 
ID NO: 7: each 
start position is 
specified, the 
length of each 



200 



511582003440 



Pos 



HLA-A26 nonamers 
123456789 



Pos 



HLA-A3 nonamers 

123456789 
MELSEGSQK 



score 



score 
21 



ELSEGSQKH 



17 



SEQ ID NO: 

peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 

Portion of SEP 
ID NO: 7; each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



Pos 



HLA-B*2705 nonamers 
123456789 
MELSEGSQK 



Pos 



ELSEGSQKH 

HLA-B*4402 nonamers 
123456789 



MELSEGSQK 



score 
15 



14 



score 



12 



SEQ ID NO: 

Portion of SEP 
IDNP: 7: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 

Portion of SEP 
ID NO: 7; each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



Class 1 decamer analysis of amino acids 1-15 (MKCKMELSEGSQKHA) of 121P1F1 splice variant lb. Listed are 
those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the analysis 
of the full length 121P1F1 parental peptide sequence. 



201 



511582003440 



HLA-A*0201 decamers 
Pos 1234567890 score 

Portion of SEP 
ID NO: 7; each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
6 ELSEGSQKHA 12 plus nine 



Pos 



HLA-A*0203 decamers 
1 2 3 4567890 



score 



Pos 



6 ELSEGSQKHA 

HLA-A26 decamers 

1234567890 



10 



score 



Pos 



ELSEGSQKHA 

HLA-A3 decamers 

1234567890 



17 



score 



KM ELSEGSQK 



23 



SEQ ID NO: 

Portion of SEP 
ID NO: 7: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 

Portion of SEP 
IDNP: 7: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 

Portion of SEP 
ID NP: 7: each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



202 



511582003440 



Class 1 nonamer analysis of amino acids 1 10-122 (AKIGRCETAKQIK) of 121 PI Fl splice variant 2. Listed are 
those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the analysis 
of the full length 121P1F1 parental peptide sequence. 



Pos 



HLA-A1 nonamers 
123456789 



114 



Pos 

111 
110 



RCETAKQ1K 

HLA-A3 nonamers 

123456789 

KIGRCETA K 
AKIGRCETA 



score 



10 



score 



26 
14 



114 RCETAKQIK 



14 



SEQ ID NO: 

Portion of SEP 
ID NO: 9; each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 

SEQ ID NO: 

Portion of SEP 
ID NP: 9: each 
start position is 
specified. the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



HLA-B*0702 nonamers 
Pos 123456789 



110 



AKIGRCETA 



score 



10 



SEQ ID NO: 

Portion of SEP 
1DNP: 9: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



HLA-B*2705 nonamers 

Pos 123456789 score 

113 GRCETAKQI 22 

114 RCETAKQIK 15 



SEQ ID NO: 

Portion of SEP 
IDNP: 9: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end2flsition for 
each peptide is 
the start position 
plus eight 



511582003440 



HLA-B*2705 nonamers 
Pos 123456789 score 

111 KIGRCETAK 14 



SEQ ID NO; 



Pos 



113 



HLA-B*2709 nonamers 
123456789 



score 



GRCET AKQI 



23 



SEQ ID NO: 

Portion of SEP 
ID NO: 9; each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



HLA-B*4402 nonamers 

HLA-B*4402 nonamers 
Pos 123456789 



score 



113 



GRCETAKQI 



12 



SEQ ID NO: 

Portion of SEP 
ID NO: 9: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



Pos 



HLA-B*5101 nonamers 
123456789 



score 



113 



GRCETAKQI 



15 



SEQ ID NO: 

Portion of SEP 
1DNP: 9: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



Class 1 decamer analysis of amino acids 109-122 (KAKIGRCETAKQIK) of 121P1F1 splice variant 2. Listed are 
those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the analysis 
of the full length 121P1F1 base peptide sequence. 



HLA-A*0201 decamers SEQ ID NO: 

Pos 1234567890 score 

204 



511582003440 



Portion of SEP 
ID NO: 9: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids. 

the end 
position for 
each peptide is 

the start 
position plus 

111 KIGRCETAKQ 13 nine 



HLA-A*0203 decamers SEQ ID NO: 

Pos 1234567890 score 

109 KAKIGRCETA 10 Portion of SEQ 

~~ ~ ID NO: 9: each 

start position is 
specified, the 
length of each 
peptide is 10 
amino acids. 

the end 
position for 
each peptide is 

the start 
position plus 

110 AKIGRCETAK 9 nine 



10 



HLA-A3 decamers SEQ ID NO: 

Pos 1234567890 score 

110 AKIGRCETAK 20 Portion of SEQ 

ID NO: 9: each 
start position is 

specified, the 
length of each 

peptide is 10 

amino acids, 
the end 

position for 
each peptide is 
the start 

position plus 

111 KIGRCETAKQ 17 nine 

Class I nonamer analysis of amino acids 148-164 (DPQVVEEIHNIFA1KSW) of 121P1F1 splice variant 3. Listed 
are those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from the 
analysis of the full length 121P1F1 base peptide sequence. 

HLA-A*0201 nonamers SEQ ID NO: 

Pos 123456789 score 

Portion of SEP 
ID NO: II; 

150 QVVEEIHNI 19 each start 

205 



511582003440 



Pos 



HLA-A*0201 nonamers 
123456789 



score 



SEQ ID NO: 

position is 
specified, the 
length of each 
peptide is 9 
amino acids. 

the end 
position for 
each peptide is 

the start 
position plus 
eight 



Pos 

152 



HLA-A1 nonamers 
123456789 

VEEIHNIFA 



score 
16 



151 



VVEEIHNIF 



11 



SEQ ID NO: 

Portion of SEQ 
ID NO: 113; 
each start 
position is 
specified, the 
length of each 
peptide is 9 
amino acids, 

the end 
position for 
each peptide is 
the start 
position plus 
eight 



Pos 
151 
154 
150 



HLA-A26 nonamers 
123456789 
VVEEIHNIF 
E I HN I F A I K 
QV VEEIHNI 



score 
22 
21 
17 



153 



EEI HNIFAI 



13 



SEQ ID NO: 

Portion of SEP 
ID NO: II: 
each start 
position is 
specified, the 
length of each 
peptide is 9 
amino acids. 

the end 
position for 
each peptide is 
the start 
position plus 
eight 



HLA-A3 nonamers SEQ ID NO: 

Pos 123456789 score 

154 EI HNIFAI K 22 Portion of SEQ 

151 VVEEIHNIF 15 IDN ° : " ; 

— each start 

2Q#tion is 

specified. the 

length of each 

peptide is 

9amino acids. 

the end 



511582003440 



HLA-A3 nonamers 
Pos 123456789 score 

150 QVVEEIHNI 13 



SEQ ID NO: 



Pos 



HLA-B*0702 

nonamers 
123456789 



score 



148 DPQVVEE1H 



10 



SEQ ID NO: 



Portion of SEP 
ID NO: 1 1: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



Pos 

155 



HLA-B*1510 

nonamers 
123456789 
1HNIFAIKS 



score 
12 



151 



V VEE1HNI F 



SEQ ID NO: 



Portion of SEQ 
ID NO: 11: each 
start position is 
specified, the 
length of each 
peptide is 9amino 

acids, the end 
position for each 

peptide is the 
start position plus 
eight 



Pos 
150 
151 



HLA-B*2705 

nonamers 

123456789 
QVVEEIHNI 
V VEEIHNI F 



score 
14 
13 



154 



EIHNIFAIK 



12 



SEQ ID NO: 



Portion of SEQ 
ID NO: 11: each 
start position is 
specified, the 
length of each 
peptide is 9 
amino acids, the 
end position for 
each peptide is 
the start position 
plus eight 



Pos 

153 



HLA-B*4402 

nonamers 
123456789 
EE I HN I FA I 



score 
29 



SEQ ID NO: 



Portion of SEQ 
ID NO: 1 1: each 
start positiojyj.s, 
specified, fH^ / 
length of each 
peptide is 9 
amino acids, the 
end position for 



511582003440 



HLA-B*4402 SEQ ID NO: 

nonamers 

Pos 123456789 score 

156 HNIFAIKSW 18 

150 QVVEE1HNI 12 

151 VVEEIHNIF 12 



HLA-B*5101 SEQ ID NO: 

nonamers 

Pos 123456789 score 
148 DPQVVEE1H 16 Portion of SEQ 

150 QVVEEIHNI 13 IPNQ: cach 

^ start position is 

specified. the 

length of each 

peptide is 9 

amino acids, the 

end position for 

each peptide is 

the start position 

153 EEIHNIFAI 11 plus eight 

Class I decamer analysis of amino acids 147-165 (CDPQVVEEIHNIFAIKSWA) of 121P1F1 splice variant 3. 
5 Listed are those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from 
the analysis of the full length 121P1F1 base peptide sequence. 



Pos 



HLA-A*0201 
decamers 
1234567890 



score 



152 V EEIHNIFAI 



13 



SEQ ID NO: 



Portion of SEP 
ID NO: 1 1; each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 



10 



Pos 
151 
156 



152 



HLA-A*0203 decamers 
1234567890 
VVEEIHN1FA 
HNIFAIKSWA 



VEEIHNIFAI 



score 
10 
10 



SEQ ID NO: 

Portion of SEP 
ID NO: 1 I: cach 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

208 



511582003440 



Pos 



151 



Pos 

150 



154 



Pos 
150 



153 



Pos 



HLA-A1 decamers 
1234567890 



VVEE1HNIFA 

HLA-A26 decamers 
123 4567 89 0 
QV VEEIHNIF 



E1HNIFAIKS 

HLA-A3 decamers 
1234567890 
QVVEEIHNIF 



E E I HNI F A I K 

HLA-B*0702 decamers 
1 2 3 456 7 890 



score 



16 



score 

22 



17 



score 
17 



16 



score 



148 



DPQV VEEIHN 



10 



SEQ ID NO: 



Portion of SEP 
IP NO: II; each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ I D NO: 

Portion of SEQ 
ID NO: 1 1: each 
start position is 
specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 

Portion of SEQ 
ID NO: 1 I; each 
start position is 

specified, the 
length of each 

peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

SEQ ID NO: 



Portion of SEQ 
ID NO: II: each 
start position is 
specified. the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 
plus nine 

209 



511582003440 



HLA-BM402 decamers SEQIDNO: 
Pos 1234567890 score 

152 VEEIHNIFAI 23 Portion ofSEQ 

153 EEIHNIFAIK 16 iDNQ: n . ;each 

start position is 

specified, the 
length of each 
peptide is 10 
amino acids, the 
end position for 
each peptide is 
the start position 

155 IHN1FAIKSW 15 plus nine 



210 



511582003440 



Table XXVII. 

MHC Class II analysis of 121 PI Fl for selected alleles. Listed are scores that fall within the top 50% (rounded 
up) of all scores for the selected allele. 





HLA-DRB1*0101 15-mers 




Pos 


123456789012345 


score 


83 


KHKLE VLESQLSEGS 


31 


86 


LEVLESQLSEGSQKH 


30 


26 


VFQLKDLEKI APKEK 


26 


48 


KEVLQSLVDDGMVDC 


26 


67 


TSN Y YWAFPSKALH A 


25 


68 


SNY Y WAFPSKALHAR 


25 


141 


VEK YKDCDPQVVEEI 


25 


39 


EKGITAMSVKEVLQS 


24 


29 


LKDLEK1 APKEKG1T 


23 


36 


APKEKG ITAMSVKEV 


23 


44 


AMSVKEVLQSLVDDG 


23 


167 


NRWTDN1FAIKSWAK 


23 


13 


RTRMMEIFSETKDVF 


20 


24 


KDVFQLKDLEKI APK 


20 


150 


QVVEEIRQANKVAKE 


20 


170 


TDN IFAI KSWAKRKF 


20 


186 


FEENKIDRTFGIPED 


20 


73 


AFPSKALHARKHKLE 


19 


80 


HARKHKLEVLESQLS 


19 


116 


ETEERTRLAKELSSL 


19 


173 


1FAIKS WAKRKFGFE 


19 


33 


EKI APKEKG 1TAMSV 


18 


138 


KAEVEKYKDCDPQVV 


18 


158 


ANKVAKEAANRWTDN 


18 


1 


MSKKKGLSAEEKRTR 


17 


15 


RMMEIFSETKDVFQL 


17 


42 


ITAMS VKEVLQSLVD 


17 


65 


IGTSN Y Y W AFPSKAL 


17 


90 


ESQLSEGSQKHASLQ 


17 


101 


ASLQKSIEKAKIGRC 


17 


117 


TEERTRLAKELSSLR 


17 


154 


EIRQANKVAKEAANR 


17 


155 


IRQANKVAKEAANRW 


17 


16 


MMEIFSETKDVFQLK 


16 


23 


TKD VFQLKDLEKI AP 


16 


35 


IAPKEKGITAMSVKE 


16 



SEQ ID NO: 



Portion of SEP ID 
NO: 3 each start 
position is specified, 
the length of each 
peptide is 1 5 amino 
acids, the end 
position for each 
peptide is the start 
position plus 
fourteen 



211 



511582003440 





HLA-DRB1*0101 15-mers 




Pos 


1 234567 890 1 2345 


score 


57 


DGMVDCERIGTSNYY 


16 


62 


CERIGTSNYYWAFPS 


16 


70 


YYWAFPSKALHARKH 


16 


113 


GRCETEERTRLAKEL 


16 


120 


RTRLAKELSSLRDQR 


16 


124 


AKELSSLRDQREQLK 


16 


127 


LSSLRDQREQLKAEV 


16 


130 


LRDQREQLKAEVEKY 


16 


131 


RDQREQLKAEVEKYK 


16 


188 


ENK1DRTFGIPEDFD 


16 


190 


Kl DRTFG I PEDFD Y 1 


16 


6 


GLSAEEKRTRMMEIF 


15 


10 


EEKRTRMMEIFSETK 


15 


49 


EVLQSLVDDGMVDCE 


15 


34 


L V U U U 1V1 V U L c K 1 u 1 o 


1 c 
I 0 




Klu Kt t 1 cbK 1 K L> 


1 c 


1 O 1 




1 c 
1 3 


1 ^ 1 


V V C L I i\ y /\ n I\ V A N L /A 


1 < 
1 J 




HLA-DRB 1*0301 (DR17) 15-mers 




Pos 


1 234 5 6 7 890 1 23 4 5 


score 


173 


IFAIKS WAKRKFGFE 


27 


126 


ELSSLRDQREQLKAE 


26 


16 


MME1FSETKDVFQLK 


25 


51 


LQSLVDDGMVDCERI 


23 


44 


AMSVKEVLQSLVDDG 


20 


148 


DPQVVEEIRQANKVA 


20 


25 


DVFQLKDLEKIAPKE 


19 


26 


VFQLKDLEKIAPKEK 


19 


127 


LSSLRDQREQLKAEV 


19 


149 


PQVVEE1RQANKVAK 


19 


152 


VEEIRQANKVAKEAA 


19 


14 


TRMMEIFSETKDVFQ 


18 


32 


LEKI APKEKGITAMS 


18 


56 


DDGMVDCERIGTSNY 


18 


82 


RKHKLE VLESQLSEG 


18 


90 


ESQLSEGSQKH ASLQ 


18 


142 


EKYKDCDPQVVEE1R 


18 


4 


KKGLSAEEKRTRMME 


17 



SEQ ID NO: 



SEQ ID NO: 



Portion of SEP ID 
NO: 3; each start 
position is specified, 
the length of each 
peptide is 15 amino 
acids, the end 
position for each 
peptide is the start 
position plus 
fourteen 



212 



511582003440 





HLA-DRB 1*0301 (DR17) 15-mers 




Pos 


1 234567 890 1 2345 


score 


75 


PSKALHARKHKLEVL 


17 


100 


HASLQKSIEKAKIGR 


17 


134 


REQLKAEVEKYKDCD 


17 


55 


VDDGM VDCERIGTSN 


16 


40 


KGITAMSVKEVLQSL 


15 


112 


IGRCETEERTRLAKE 


15 


181 


KRKFGFEENK1DRTF 


15 


175 


AIKSWAKRKFGFEEN 


14 


19 


IFSETKDVFQLKDLE 


13 


47 


VKEVLQSLVDDGMVD 


13 


83 


KHKLE VLESQLSEGS 


13 


85 


KLEVLESQLSEGSQK 


13 



SEQ IP NO: 



HLA-DRB 1*0401 (DR4Dw4) 15-mers 

Pos 123456789012345 score 

68 SNYY WAFPSKALH AR 28 

13 RTRMMEIFSETKDVF 26 

44 AMSVKE VLQSLVDDG 26 

83 KHKLEVLESQLSEGS 26 

148 DPQVVEEIRQANKVA 26 

149 PQVVEEIRQANKVAK 26 
170 TDN1FAIKS WAKRKF 26 
67 TSN Y YWAFPSKALHA 22 
181 KRKFGFEENKIDRTF 22 
23 TKDVFQLKDLEKIAP 20 
29 LKDLEK1 APKEKGIT 20 
48 KEVLQSLVDDGMVDC 20 

56 DDGMVDCERIGTSNY 20 

57 DGMVDCER1GTSNYY 20 
86 LEVLESQLSEGSQKH 20 
90 ESQLSEGSQKHASLQ 20 
120 RTRLAKELSSLRDQR 20 
134 REQLKAEVEKYKDCD 20 
152 VEEIRQANKVAKEAA 20 
5 KGLSAEEKRTRMMEI 18 
72 WAFPSKALHARKHKL 18 
106 SIEKAKIGRCETEER 18 

112 IGRCETEERTRLAKE 18 

113 GRCETEERTRL AKEL 18 

213 



SEQ ID NO: 



Portion of SEQ ID 
NO: 3; each start 
position is specified, 
the length of each 
peptide is 1 5 amino 
acids, the end 
position for each 
peptide is the start 
position plus 
fourteen 



511582003440 





HLA-DRB 1*0401 (DR4Dw4) 15-mers 




Pos 


1 2 3456 7 89 0 1 23 45 


score 


126 


ELSSLRDQREQLKAE 


18 


159 


NKVAKEA ANRWTDNI 


18 


186 


FEENKIDRTFGIPED 


18 


17 


MEIFSETKDVFQLKD 


16 


141 


VEKYKDCDPQVVEEI 


16 


166 


ANRWTDNIFA1KSWA 


16 


183 


KFGFEENKIDRTFGI 


16 


4 


KKGLSAEEKRTRMME 


14 


14 


TRMMEIFSETKDVFQ 


14 


16 


MMEIFSETKDVFQLK 


14 


26 


VFQLKDLEKI APKEK 


14 


39 


EKGITAMSVKEVLQS 


14 


51 


LQSLVDDGMVDCERI 


14 


62 


CERIGTSN Y Y W AFPS 


14 


100 


HASLQKSIEKAKIGR 


14 


104 


QKSI EK AKIGRCETE 


14 


109 


KAKIGRCETEERTRL 


14 


124 


AKELSSLRDQREQLK 


14 


127 


LSSLRDQREQLKAE V 


14 


158 


ANKVAKEAANRWTDN 


14 



SEQ ID NO: 





HLA-DRB1*1101 15-mers 




Pos 


1 2 3456 7 890 1 23 4 5 


score 


26 


VFQLKDLEKI APKEK 


26 


117 


TEERTRLAKELSSLR 


23 


83 


KHKLE VLESQLSEGS 


20 


155 


IRQANKVAKEAANRW 


20 


185 


GFEENKIDRTFGIPE 


20 


69 


NYYWAFPSKALH ARK 


19 


67 


TSNYYWAFPSKALHA 


17 


16 


MMEIFSETKDVFQLK 


16 


173 


IFAIKS W AKRKFGFE 


16 


4 


KKGLSAEEKRTRMME 


15 


30 


KDLEKI APKEKGITA 


15 


32 


LEKIAPKEKGITAMS 


15 


76 


SKALHARKHKLEVLE 


15 


97 


SQKHASLQKSIEKAK 


15 


101 


ASLQKSIEKAKIG RC 


15 


135 


EQLKAEVEKYKDCDP 


15 



SEQ ID NO: 



Portion of SEP IP 
NO: 3: each start 
position is specified, 
the length of each 
peptide is 15 amino 
acids, the end 
position for each 
peptide is the start 
position plus 
fourteen 



214 



511582003440 





HLA-DRB1M101 15-mers 




Pos 


1 234 567 890 1 2345 


score 


10 


EEKRTRMMEIFSETK 


14 


39 


EKGITAMSVKEVLQS 


14 


48 


KEVLQSLVDDGMVDC 


14 


56 


DDGMVDCER1GTSNY 


14 


91 


SQLSEGSQKHASLQK 


14 


106 


SIEKAKIGRCETEER 


14 


124 


AKELSSLRDQREQLK 


14 


148 


DPQVVEEIRQANKVA 


14 


152 


VEEIRQANKVAKEAA 


14 


169 


WTDNIFA1KSWAKRK 


14 


174 


FAIRS WAKRKFGFEE 


14 


23 


TKDVFQLKDLEKIAP 


13 


42 


ITAMSVKEVLQSLVD 


13 


44 


AMSVKEVLQSLVDDG 


13 


166 


ANRWTDNIFAIKSWA 


13 


167 


NRWTDN1FAIKS WAK 


13 


170 


TDN1FAIKSWAKRKF 


13 



Class II 15-mer analysis of amino acids 80-126 

(HARKHKLEVLESQDPGCCFHE1IKVSYYRKFWLGAVAHACNPSTLGG) 

of 121 PI Fl splice variant la. Listed are those alleles and peptides in which the score falls within the top 50% 
5 (rounded up) of the scores from the analysis of the full length 12 1 PI Fl base peptide sequence. 

HLA-DRB 1*0101 15-mers 
Pos 123456789012345 

83 KHKLEVLESQDPGCC 

104 VSYYRKFWLGAVAHA 

86 LEVLESQDPGCCFHE 
103 KVSYYRKFWLGAVAH 
80 HARKHKLEVLESQDP 
99 HEIIKVSYYRKFWLG 

107 YRKFWLGAVAHACNP 

105 SYYRKFWLGAVAH AC 

108 RKFWLGA VAHACNPS 

106 YYRKFWLGAVAHACN 

87 EVLESQDPGCCFHEI 
95 GCCFHEIIKVSY YRK 
98 FHEIIKVSYYRKFWL 
101 1IKVSYYRKFWLG A V 
110 FWLG A VAH ACNPSTL 



SEQ ID NO: 



score 



31 Portion of SEP 



22 
20 



ID NO: 5: each 
start position is 
specified, the 
20 length of each 

19 peptide is 15 

amino acids, the 
end position for 
19 each peptide is 

18 the start position 

18 plus fourteen 

17 
16 
16 
16 
16 
16 



HLA-DRB 1*0301 (DR17) 15-mers 

215 



SEQ ID NO: 



I 



511582003440 



Pos 


1 234567890 1 2345 


score 


95 


GCCFHEIIKVSY YRK 


24 


101 


llKVo i YKIvr WLOAV 


Z4 


99 


HEI IKVSY YRKF WLG 


20 


87 


E VLESQDPGCCFHEI 


19 


112 


LGAVAHACNPSTLGG 


16 


85 


KLE VLESQDPGCCFH 


13 



Portion of SEP 
ID NO: 5; each 
start position is 
specified, the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



HLA-DRB 1*0401 (DR4Dw4) 15-mers 



Pos 


1 234 567890 1 23 45 


score 


109 


KFWLG AVAHACNPST 


26 


112 


LGAVAHACNPSTLGG 


26 


104 


VSYYRKFWLGAVAHA 


22 


83 


KHKLEVLESQDPGCC 


20 


98 


FHEIIKVS Y YRKF WL 


20 


95 


GCCFHEIIKVSYYRK 


16 


107 


YRKFWLGAVAHACNP 


16 


108 


RKFWLG AVAHACNPS 


16 


101 


IIKVSYYRKFWLGAV 


14 



SEQ ID NO: 



Portion of SEP 
IPNP: 5: each 
start position is 

specified, the 
length of each 

peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 

plus fourteen 



Pos 


1234567890 12345 


score 


95 


GCCFHEIIKVSYYRK 


24 


109 


KFWLG AVAHACNPST 


20 


83 


KHKLEVLESQDPGCC 


19 


103 


KVSYYRKFWLGAVAH 


16 


107 


YRKFWLGAVAHACNP 


16 


98 


FHEIIKVS YYRKFWL 


14 


101 


IIKVSYYRKFWLGAV 


14 



HLA-DRB1*1101 15-mers SEQ ID NO: 

Portion of SEP 
ID NO: 5; each 
start position is 
specified, the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 

Class II 15-mer analysis of amino acids 1-20 (MKCKMELSEGSQKHASLQKS) of 121P1F1 splice variant lb. 
Listed are those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from 
the analysis of the full length 12 1 PI Fl base peptide sequence. 

10 

HLA-DRB1*0101 15-mers SEQ ID NO: 

Pos 123456789012345 score 

2 KCKMELSEGSQKH AS 18 Portion of SEQ 

ID NO: 7; each 
start position is 
specified, the 
length of each 
peptide is 1 5 
216 amino acids, the 

end position for 
each peptide is 
the start position 
plus fourteen 



511582003440 



HLA-DRB1*0101 15-mers SEQ ID NO: 

Pos 123456789012345 score 

4 KMELSEGSQKHASLQ 17 



Pos 



HLA-DRB1*0301 (DR17) 15-mers 
123456789012345 



score 



KMELSEGSQKHASLQ 



18 



SEQ ID NO: 

Portion of SEP 
ID NO: 7: each 
start position is 

specified, the 
length of each 

peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 

plus fourteen 



HLA-DRB 1*0401 (DR4Dw4) 15-mers 

Pos 123456789012345 
4 KMELSEGSQKHASLQ 



score 
20 



KCKMELSEGSQKHAS 



14 



SEQ ID NO: 



Portion of SEP 
ID NO: 7: each 
start position is 
specified, the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



Pos 



HLA-DRB1*1101 15-mers 
1 2 34 567890 1 2 3 45 



score 



MELSEGSQKH ASLQK 



14 



SEQ ID NO: 

Portion of SEP 
ID NO: 7; each 
start position is 
specified. the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



10 



Class II 15-mer analysis of amino acids 104-122 (QKS1EKAKIGRCETAKQIK) of 121P1F1 splice variant 2. 
Listed are those alleles and peptides in which the score falls within the top 50% (rounded up) of the scores from 
the analysis of the full length 121P1F1 base peptide sequence. 



HLA-DRB 1*0401 (DR4Dw4) 15- 
mers 



SEQ ID NO 

217 



511582003440 



Pos 123456789012345 score 

106 SIEKAKIGRCETAKQ 18 Portion ofSEQ 

ID NO: 9: each 
start position is 
specified. the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 

104 QKSIEKAKIGRCETA 14 plus fourteen 



HLA-DRB1*1101 15-mers SEQ1DNQ: 
Pos 123456789012345 score 

Portion of SEP 
ID NO: 9: each 
start position is 

specified, the 

length of each 

peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 

106 SIEKAKIGRCETAKQ 14 plus fourteen 

Class II 15-mer analysis of amino acids 142-170 (EKYKDCDPQVVEEIHNIFA IKSWAKRKFG) of 121P1F1 
splice variant 3. Listed are those alleles and peptides in which the score falls within the top 50% (rounded up) of 
the scores from the analysis of the full length 121P1F1 base peptide sequence. 



Pos 

152 
149 
155 



HLA-DRB1*0101 15-mers 
1 2 3 4 567 890 1 234 5 
VEEIHNIFAIKS WAK 
PQVVEE1HNIFAIKS 
IHNIFAIKS WAKRKF 



score 
31 

22 
20 



148 



DPQVVEEIHNIFAIK 



17 



SEQ ID NO: 

Portion of SEP 
ID NO: 11: each 
start position is 
specified. the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



10 



HLA-DRB 1*0301 (DRI7) 15-mers 
Pos 123456789012345 
148 DPQVVEEIHNIFAIK 
142 EKYKDCDPQV VEE1H 



score 
21 
18 



218 



SEQ ID NO: 

Portion of SEP 
ID NO: 11; each 
start position is 
specified, the 
length of each 
peptide is 1 5 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



511582003440 



HLA-DRB 1 *030 1 (DR 17)15- mers SEQ ID NO: 

Pos 123456789012345 score 

149 PQV VEEIHNIFAIKS 17 





HLA-DRB 1*0401 (DR4Dw4) 15- 






mers 




Pos 


123456789012345 


score 


149 


PQV VEEIHNIFAIKS 


26 


155 


IHNIFAIKSWAKRKF 


26 


148 


DPQVVEEIHNIFAIK 


20 


152 


VEEIHNIFAIKSWAK 


20 




HLA-DRB1*1101 15 -mers 




Pos 


1 2 3 456 7890 12 3 45 


score 


152 


VEEIHNIFAIKSWAK 


19 


149 


PQVVEEIHNIFAIKS 


18 


7 


DPQVVEEIHNIFAIK 


15 


13 


EIHNIFAIKSWAKRK 


14 


14 


IHNIFAIKSWAKRKF 


13 



SEQ ID NO: 



Portion of SEP 
ID NO: I keach 
start position is 
specified. the 
length of each 
peptide is 1 5 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



SEQ ID NO: 

Portion of SEP 
ID NO: II: each 
start position is 
specified, the 
length of each 
peptide is 15 
amino acids, the 
end position for 
each peptide is 
the start position 
plus fourteen 



219 



